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COMPOSITIONS AND METHODS FOR DETECTING. LIGAND-DEPENDENT . 
NUCLEAR RECEPTOR AND COACTIVATOR INTERACTIONS . 

5 TECHNICAL FIELD . 

The invention, relates to the fields of molecular genetics and pharmacology. The 
invention provides methods and. compositions for determining-the capability of a compound, a. . 
macromolecular species, or of a stimulatory effector to produce aconformational change in a 
predetermined nuclear receptor and/or its accessory protein(s), typically to : agonize or antagonize a 

io ligand-induced activation of the nuclear receptor. An aspect of the invention can provide a means for 
identifying agents which are pharmacological agonists or antagonists for one or more predetermined 
nuclear receptor species. An aspect of the invention relates to a method of rank-ordering a set of 
compounds with respect to each compound's ability to affect an interaction between one or more 
nuclear receptor species and a plurality of coactivator and/or compressor species or similar interfaces; 

is the rank-ordering provides a dataset for identifying pharmacologically active agonists; antagonists, . 
partial agonists, potentiators, and the like. 

BACKGROUND OF THE INVENTION 

20 A variety of nuclear receptors exist in animal cells and generally function to effectuate 

transcriptional regulation on one or more subsets of regulable genes. Most often, the nuclear receptor 
. exhibits a high-affinity binding interaction with one or more species of hydrophobic ligand. These 
. ligand binding interactions can produce a conformational change in the ngclear receptor. The 
conformational change induced by ligand binding modifies the ability of the nuclear receptor to interact 
25 with certain specific receptor-binding DNA sequences and/or to interact with other nuclear protein^ . 
. (e.g., transcription factors, coactivators, compressors), so as to modulate the transcription of genes 
having the specific receptor-binding DNA sequence(s) located so as to influence the transcription, of 
the gene. In one model of steroid receptor action, a ligand-induced conformational qhange in a ligand- 
. binding domain unmasks a DNA-binding activity in another structural domain of Jhe steroid receptor 
30 protein. In the absence of this ligand-induced conformational change, the ligand-binding. domain ; 
represses the DNA binding activity of the linked structural domain; It has been recently shown that 
: one or more superfamily. of proteins, termed "coactivators" and u corepressdrs'\ respectively interact 
with nuclear recepiors in a ligand-dependent fashion so as to effect transcnptional activation . 
(coactivators) or so as to inhibit or silence transcription (corepress'ors) of genes which are j - . 
35 transcriptionally modulated by nuclear receptors. 

Nuclear;hornione receptors comprise a superfamily of oyer 40 transcription factors.- 
About half of them are classical receptors for lipophilic ligands such as steroids and vitamins. The . 
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nuclear hormone receptor gene superfamily encodes structurally, related proteins that regulate 
transcription of target genes. These macromolecules include receptors for steroid and thyroid 
hormones, vitamins, retinoids, fatty acids, and other nuclear receptor proteins for which no ligands " 
have been found, so-called "orphan receptors". These receptors^have modular domains with readily 
5 identifiable structural features and sequence motifs. The DNA-binding domain ("DBD") directs the 
receptors to bind specific DNA sequences as monomers, homodimers, or heterodimers. The 
ligand-binding domain ("LBD") responds to binding of the cognate hormone; this domain and the 

• amino terminal domain interact with other transcription factors, and with the coactivators and/or 
corepressors. Nuclear receptor-specific actions are derived from a combination of diverse elements 

10 including availability of ligand, receptors, and nonreceptor factors; target-site structure; interactions ' 
w.th other proteins, such as the general transcription factors and very importantly with the coactivator 
and/or corepressor proteins: 

The steroid/thyroid hormone receptor superfamily of ligand-activated transcription 
15 factors encompasses not only the receptors for steroids, thyroid hormone, retinoids and vitamin D, but 
also a large number of proteins whose functions and/or ligands are unknown and which are thus ' 
termed orphan receptors. This family of transcription factors integrates signals from ligands as well as 
from signal transduction pathways, resulting in alterations in mRNA and protein expression that are 
unique to the complex signals received. These nuclear receptors are implicated in the control of a 
20 wide range of physiological responses and homeostatic conditions, including cell differentiation, 
neoplasia, control of cellular metabolism, and neurological function. For a review of the steroid ' 
hormone receptor superfamily, see Ribiero RC (1995) Annu. Rev. Med. 46: 443-453. 

There has been substantial interest in identifying ligands which interact with nuclear 
25 receptors and modulate the biological effects mediated by these nuclear receptors. Such ligands, 
whether agonistic or antagonistic to natural physiological ligands of the receptors, would serve as 
candidate pharmaceuticals for controlling the biological effects of nuclear receptor-mediated 
transcriptional control and the attendant physiological effects produced thereby. . Unfortunately, most . 
conventional assays for identifying potential ligands re upon the use of libraries of radiolabeled • 
30 compounds which are tested for their binding coefficient (e.g., via Scatchard analysis) to a purified , 
nuclear receptor species. It is difficult and labor-intensive to obtain such libaries of radiolabeled 
compounds and then screen the library using binding assays: Furthermore, it has been found that a 
compound's binding constant is not necessarily predictive of its biological activity as a ligand. As a 
better proxy for ligand function, transcriptional assays have been developed to assay.for ligand- 
35 induced transcriptional activation of a nuclear receptor as detected-by transcription of a reporter 
sequence operably linked to a nuclear receptor response element and promoter. 
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Unfortunately, many of the transcriptional responses generated by ligand-activated 
nuclear receptors can be subtle and are frequently difficult to detect and/dr quantify by conventional 
transcriptional assay procedures, which are relatively insensitive for monitoring expression of genes 1 
which are not abundantly transcribed. Furthermore, many of the conventional transcription assay 

5 procedures are difficult to perform and entail problematic steps, such as requiring lysis of the cells 
being assayed. It is desirable to have a method for detecting ligands of predetermined nuclear 
receptors with high specificity, sensitivity, and selectivity. In particular, methods to reduce readout 
background noise that can obscure legitimate signals would find great use in the art for identifying 
novel pharmaceutical agents that are nuclear receptor ligands, as well as providing sensitive assays 

10 for detection and quantitation of ligands which are environmental pollutants that activate nuclear 
receptors (e.g., TCDQ). 

Moreover, many nuclear receptor ligands, particularly steroids and steroid-like 
compounds, often exhibit pleiotropic biological effects through nuclear receptors. For example, both 
15 estradiol and.tamoxiphen bind to the estrogen receptor, but each compound can produce different 
biological effects arid transcriptional profiles (he.-, the set of genes which are transcriptionally 
modulated by ligand presence) depending upon the tissue and cell-type involved. 

Thus, there exists a need in the art for methods to efficiently identify agents which 
20 modulate nuclear receptor function. It is important that such methods have the necessary levels of 
sensitivity and specif icity for identifying bona fide nuclear receptor agonists and/or antagonists. The 
present invention fulfills these and other needs in the art. 

. The references discussed herein are. provided;solely for their disclosure prior to the 
25 filing date of the .present application. Nothing herein is to be construed as an admission that the 
inventors are not entitled to antedate such disclosure by virtue of prior invention. . 

SUMMARY OF THE INVENTION / ': 

30 <Jn accordance with the foregoing - in one aspect of the invention are provided methods 

for identifying agents which are agonists or antagonists for ligand-iriduced activation* of a 
predetermined nuclear receptor. There are a variety of formats of the invention, and a plurality of 
formats may be multiplexed sequentially or in parallel to produce a discrimination profile of a test 
agent in order to categorize the pharmacological nature of -the test agent (e.g., agonist, antagonist, 
35 partial agonist, mixed agonist/antagonist, etc.), so as to enable dissecting the pleotrppic biological 
effects of nuclear receptor ligands and correlating the individual effects with chemical structure. 
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Positive Hybrid System: Coactivators 

The methods of the invention typically employ a positive hybrid nuclear receptor 
signal transduction system, which typically comprises an intact eukaryotie host cell, comprising: (1) a 
LBD-TRX polynucleotide sequence encoding and expressing a ligand-activatable fusion protein 
5 (referred to herein as "LBD-TRX") which comprises a ligand binding domain ("LBD") of a . 
predetermined nuclear receptor covalently linked, typically in polypeptide linkage, to a transcriptional 
activator domain ("AD") or alternatively to a DNA-binding domain ("DBD") of a predetermined 
transcription factor (generically, AD and DBD are referred to as "TRX" for convenience),^) a CA-TRX 
polynucleotide sequence encoding and expressing a eoactivator fusion protein (referred to herein as 
10 "CA-TRX") which comprises a domain of a nuclear receptor eoactivator protein ("CA") capable of 
binding to said LBD of said predetermined nuclear receptor linked, typically in polypeptide linkage, to c 
transcriptional activator domain or alternatively to a DNA-bindirig domain of a predetermined 
transcription factor (collectively "TRX"), and (3) a reporter polynucleotide sequence comprising, in 
linear order, a transcriptional regulatory sequence which is responsive to said predetermined 
15 inscription factor and a reporter cassette encoding a sequence that confers a signal or detectable 
phenotype. The TRX components of the LBD-TRX protein and the CA-TRX protein should be 
functionally complementary; i.e., if the TRX of one fusion is a DBD, the TRX of the Other fusion is an 
AD, and vice versa. Prior to contacting of the system with an agent which is an agonist ligand of the 
LBD, the transcriptional activity of the LBD-TRX and CA-TRX with respect to the reporter 
20 polynucleotide is substantially absent and the system substantially lacks functional expression of an 
v encoded sequence of the reporter polynucleotide. Subsequent to contacting of the system with an 
agonist agent which produces an activated conformation of the LBD in the LBD-TRX fusion protein, 
the CA-TRX fusion protein functionally associates forming a LBD-TRX/CA-TRX complex that is ' 
transcriptionally active with respect to the reporter polynucleotide, whereupon the reporter cassette 
25. can be transcribed and functionally expressed under the control of the transcriptional regulatory 
sequence. Thus, f unctional expression of the reporter cassette serves to report whether the system 
has been contacted with an agent which induced an activated conformation of the LBD. 
Advantageously, in one variation, the reporter cassette encodes a cell surface reporter protein which 
can be detected and/or selected for on the basis of its presence on the surface of a cell membrane! In 
o embodiments where the nuclear. receptor signal transduction system consists of metabolically active 
cells having a polynucleotide sequence encoding the LBD-TRX fusion, the CA-TRX fusion, and a 
reporter polynucleotide sequence, the system is referred to as a CA-TRX reporter cell, 

-■■ A nuclear receptor signal transduction system of the present invention can be used to 
5 evaluate one or more .test agents for their ability to activate the LBD of a predetermined nuclear 
receptor. Typically, the ability to activate the LBD indicates, an agonistic activity as referenced to the . 
physiological ligand of the nuclear receptor. In this aspect of the invention, a Jest agent is applied to a 
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nuclear receptor signal transduction system and inculpated for a suitable incubation period anticipated 
to be substantially sufficient fqr theageht to transduce a signal to the LBD. After the incubation 
period, the expression of the reporter cassette is determined by detecting the presence of. the reporter. 
Test agents which produce a statistically significant increase in reporter as compared to . background 

5 (e.g., placebo) are scored as receptor agonists. In embodiments where the. nuclear receptor signal 
transduction system comprises a metabolically active intact reporter cell, typically a population of 
reporter cells is contacted with a test agent, and the ability of the test agent to function as a receptor ' m 
agonist is determined by either the relative expression of detectable reporter above background as 
. cbmparedto a reference population of reporter cells under substantially equivalent conditions.in the 

10 absence of said test agent. Dose-response data can be generated in this manner using a variety of 
different concentrations of the test agent. \; 

In a variation, the method may be used to identify antagonists of a predetermined 
nuclear receptor LBD by determining the ability of a test agent to produce a statistically significant ' 
15 reducfon in the expression the reporter by inhibiting the capacity of a unit dose of a predetermined 
activating ligand .of the LBD from producing the amount of expression of the reporter as occurs in the 
absence of the test agent. 

Positive Hybrid System: Compressors 

20 : The invention can also employ a composition and method for identifying ligands 

which induce binding of a compressor to the LBD; These systems typically employ a positive hybrid 
. nuclear receptor signal transduction system (a "CR-TRX" system), which typically comprises an intact 
- eukaryotic host cell, comprising: (1 ) a LBD-TRX polynucleotide sequence encoding and expressing a 
ligand-activatable fusion protein (referred to herein as "LBD-TRX") which comprises a ligand binding 
25 domain ("LBD") of a predetermined nuclear receptor covaleritly linked, typically in polypeptide linkage, 
to a transcriptional activator domain ("AD") or alternatively to a DMA-binding domain ("DBD") of a : 
predetermined transcription factor (genericajly, AD and DBD are referred to as "TRk" for \ 
convenience), (2) a CR-TRX polynucleotide sequence encoding and expressing a compressor fusion 
protein (referred to herein as"CR-TRX") which comprises a domain of a nuclear receptor compressor * 
30 protein ("CR") capable of binding to said LBD of said predetermined nuclear receptor linked, typically 
in polypeptide linkage, to a transcriptiona! activator domain or alternatively to a DNA-binding domain. of 
a predetermined transcription factor (collectively "TRX"), and (3) a reporter polynucleotide sequence ; . 
comprising, in linear order, a transcriptional regulatory sequence which, is responsive to said . 
predetermined transcription factor and a reporter cassette encoding a sequence that confers a signal 
■35'. -or detectable phe.notype. The TRX components of the LBD-TRX protein and the CR-TRX .protein, 
should be functionally complementary; i.e., if the TRX of one fusion is a DBD, the TRX of the other • : 
fusion. is an AD ? and. vice versa. Prior to contacting of the system with an agent which is an agonist 
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bgand o the LBD, the transcriptional activity of the LBD-TRX and CR-TRX with respect to the reporter 
po.ynudeot,de is substantial* absent and the systenhsubstantially lacks function/e^ionTn 
encoded sequence of the reporter polynucleotide. Subsequent to contacting of the system with an 

- ZZtuyI WhiCh Pr ° dUCeS ^ » ated conf °^tion of the LBD in the LBD-TRX fusion protein ■ 
o he CR-TRX fus.on protein functionally associates forming a LBD-TRX/CR-TRX complex that is ' 

- transcriptionally active with respect to the reporter polynucleotide, whereupon the reporter cassette 
can be transcribed and functionally expressed under the control of the transcriptional regulatory 
sequence. Thus, functional expression of the reporter cassette serves to report whether the system 
has been contacted with an agent which induced an activated conformation of the LBD 

10 ^n^rt^Tri ' n ^" a *' Qn ' ^ e ) 6 P° rter ^^«npo<*es a-.cell surface- reporter protein which • 
can be detected and/or selected for on the basis of its presence on the surface of a cel. membrle ,n 

cells hav,ng a polynucleotide sequence encoding the LBD-TRX fusion, the CR-TRX fusion anda 
^ reporter polynucleotide sequence, the system is referred to as a CR-TRX reporter cell. ' 

evah , flte AnUC,eartecep ^ 

evaluate one or more test agents for their ability to activate the LBD of a predetermined nuclear 
receptor and .nduce corepressor binding. Typically, the ability to activate the LBD in this manner ; 

20 of Tl 5 T " nta9 ° niStk: ^ Partia ' (miX6d) anta9bniStjc a ' tivity * ^ced to a physiological ligand 

and ncubated for a suable 
, .^duce^gnalfotheLBD. After the incubation period, the expression of the reporter cassL 

.25.antagpn^or P art B l(mbced) ; 
^ductKm system comprises a metaboKcally active intact reporter^.,: typically a population of 
• ~elM conned with a test agent, and the ability of thetestagent to function's acceptor- : 

a £'W : 

S rn a9enl D ° Se - reSp0nSeda,a ca ^e generated ]n this manner using a variety of 

different concentrations of the test agent.. ..' " y y 

of a predetermine T™' ^ USed a90n,Sts or ^ agonists I 

^y^™ 1 reducton ' n *» expression the reporter by inhibiting the capac£ of a unit dose 

' Z " 6rm,ned anta9 ° niSt ' i9and ^ * '- Pacing the amount of expreL S • 
.. reporter as occurs in the absence of the test ; agent," ... \ ' 
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Reverse Hybrid Svstern:Corepressor 

In a variation, the invention provides a nuclear receptor signal transduction system, . 
comprising a ''reverse hybrid" reporter host cell containing: (1) an LBD-TRX polynucleotide sequence 
. encoding a fusion protein comprising a ligand-binding domain of a nuclear receptor in polypeptide 
5 linkage a DBD or AD of a transcription factor; (2) a CR-TRX polynucleotide sequence encoding and 
expressing a compressor fusion protein (referred to herein as "CR-TRX") which comprises a domain 
of a nuclear receptor corepressor protein ("CR") capable of binding to said LBD of said predetermined 
nuclear receptor linked, typically in polypeptide linkage, to a transcriptional activator domain or 
alternatively to a DNA-binding domain of a predetermined transcription factor (TRX), and (3) a relay 
10 (or signal inverter) gene encoding a protein which is efficiently expressed as a consequence of the 
LDB-TRX binding to the CR-TRX as a transcriptionally active complex, and (4)reporter gene 
comprising, in linear order, a transcriptional regulatory sequence responsive to said transcriptionally 
- active complex, and a reporter cassette encoding a reporter, wherein the reporter gene is efficiently 
expressed then the product of the relay (or signal inverter) gene is substantially absent and is either 
is poorly expressed or not expressed when the relay (or signal inverter) gene is efficiently expressed. In 
an aspect/the LBD is a functional portion of a steroid hormone superfamily receptor capable of 
binding a cqrepressor protein in the absence of activating ligand and capable of undergoing a 
stimuluis-induced conformational change, such as by binding a known agonistic ligand, so as to 
r reduce or abrogate binding to the corepressor. The reverse hybrid system facilitates identification of 
20 test agents that relieve corepressor binding to the LBD. 

A reverse hybrid nuclear receptor signal transduction system of the present invention 
can be used .to evaluate one or more test agents for their ability to activate the LBD of a 
predetermined nuclear receptor by relieving corepressor binding. Typically, the ability to activate the 
25 LBD indicates an agonistic activity as referenced to thaphysiologic&l ligand of the nuclear receptor, 
however given the pleotropic nature of many nuclear receptor ligands, it is possible that some such 
agents will be agonistic with respect to certain effects and antagonistic with respect to others. In this 
aspect of the invention, a test agent is applied to a reverse hybrid nuclear receptor signal transduction 
system and incubated for a suitable, incubation period anticipated to be substantially sufficient for the 
30 agent to transduce a signal to the LBD. After the incubation period, the expression of the reporter 
cassette is determined by detecting the presence of the reporter. Test agents which produce a 
statistically significant increase in reporter as compared to background (e.g., placebo). are scored as 
receptor agonists. In embodiments where the reverse hybrid nuclear receptor signal transduction 
system comprises a metabolically active intact reporter cell, typically a population of reporter cells is 
35 contacted with a test agent, and the ability of the test agent to function as a receptor agonist is 

determined by either the relative expression of detectable reporter above background as compared to 
a reference population of reporter cells under substantially equivalent conditions in the absence of 
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said test agent. Dose-response data can be generated in this manner using a variety of different 
concentrations of the test agent. 

In a variation, the method may be used to identify antagonists of a predetermined 
nuclear receptor LBD by determining the ability of a test agent to produce a statistically significant 
reduct.on in the expression the reporter by inhibiting the capacity of a unit dose of a predetermined 
activating ligand of the LBD from producing the amount of expression of the reporter as occurs in the 
absence of the test agent. 

Reverse Hybrid System Coactivators 

In a variation of the reverse hybrid system, a coactivator (CA) domain may be used 
instead of a corepressor (CR) domain, and the system can be used, for example, to identify agents 
that .nh.b.t, abrogate, disrupt, dinimish, interfere with, or otherwise antagonize the functional 
interaction of the LBD and the coactivator domain. Such agents need not necessarily interact with the 
is 'igand-.bmd.ng pocket of the LBD, and, for example, may interact with a binding interface between the 
, coact.vatpr and the LDB, or at other sites. The method may be used to identify functional antagonists 
of a predetermined nuclear receptor LBD by determining the ability of a test agent to produce a 
statistically significant reduction or relative inhibition in the expression the reporter by inhibiting the 
capacty of a unit dose of a predetermined activating ligand of the LBD from producing the amount of 
20 expression of the reporter as occurs in the absence of the test agent. 

The invention can also be modified to identify agents that modulate, either positively 
or negatively, binding interactions between an LBD and a coactivator or corepressor without said 
: agents necessarily interacting With a bona fide ligand-binding domain of the LBD. Generally these 
25 formats employ an LBD, or binding portion thereof, which is functionally constitutive (i.e not'ligand- 
dependent) for binding to a coactivator or corepressor, or a binding fragment thereof. Such 
constitutive LBD species can be found as natural variants, can be generated by mutagenesis and 
selecfon for constitutive function, or can Degenerated by employing a high-affinity bona fide ligand at 
high concentration or covalently attached to the ligand binding pocket (e.g., via photoaffinity labelling) 
30 or other means known to those skilled in the art. In such systems, the constitutive LBD and 
coactivator (or corepressor) are used in LBD-TRX and CA-TRX (or CR-TRX) fusions in a reverse 
hybrid systemand agents are screened for their ability to inhibit the constitutive LBD.CA (or LBD:CR) . 
interaction as measured by a positive readout of the reporter gene. 



35. 



Direct Interaction Methods .'. 



In a variation of the invention, direct physical interaction, measured as binding 
between a LBD domain and a CA domain (or CR domain) as a consequence of ligand presence can 
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be determined. In an aspect, an LBD domain is immobilized on a capture surface and a soluble, 
- labelled or epitope-tagged CA or CR domain is ; introduced under aqueous . physiological conditions/ 
either in the absence or presence of a known ligand or a test agent. A typical format of the direct 
method can be an ELISA, for illustration. Agents which produce a ligand-induced binding between the 
5. immobilized LBD, and the soluble, labelled or epitope-tagged CA (or CR), thereby resulting in the CA 
(or CR). becoming immobilized on the capture surface and retained following washing of the surface 
with a rinse solution substantially lacking soluble, labelled or epitope-tagged CA (or CR), and thus 
retained on the capture surface and detected by suitable means, can be identified as candidate 
ligands. 

10 , " 

In a variation, the CA (or CR) can be immobilized on the capture surface and the LBD 

can be labelled or epitope-tagged. Epitope-tagged proteins can generally be detected by 
* immunochemical methods using at least one antibody species that is specifically reactive with the 

epitope. 

15 . . . ■ ' " - : "• ' ' . - . . • . 

In a variation, the LBD species (or a multiplicity thereof) is immobilized on the capture 
surface and the soluble, labelled CA and/or CR species (or a multiplicity. of species thereof) can be 
used to identify ligand-induced binding interactions or ligand-dependent relief of binding interactions 
between an LBD and a CA or CR (or multiple combinations thereof). In such variations/it is usually 

20 preferable to employ distinctive labels or epitope-tags for each species of CA and/or CR, which can; 
provide a basis for discrimination of which specie(s) of CA or CR bind to an LBD species (or a ' 
collection of LBD species)based upon unique detection of each label or tag on the capture surface; 
Vice versa, a CA or CR (or multiple species thereof) can be immobilized on the capture surface and 
multiple species of uniquely labeled of tagged LBiDs may be used. In each case, a test agent can be 

25 evaluated for its ability to produce a concentration-dependent binding between LBD and CA or CR 

;. : species and compared to a parallel reaction lacking agent and/or to a parallel reaction lacking agent 
and containing a known ligand, either agonist or antagonist: Variations which employ multiple species 
of LBD are termed "LBD multiplexecf and variations which employ multiple species of CA or CR are . 
termed "CA multiplexed" or "CR multiplexed", respectively. Test agents are categorized based on 

30 their effect as an agonist, antagonist, or lack of such effect with respect to each combination of LBD 
and CA or CR. Concentration dependence and EC^ or IC M values can be additional parameters for 
categorization of a test agent as an agonist or antagonist with regard to a particular end point. 

Multiplexing of Formats 

35 . L ■ * . The invention employs several formats, such as those just described. These formats 
can be employed Jn parallel to provide a multiplexed format assay. A multiplexed format assay of the 
invention comprises at least two of the following: (1 ) positive hybrid system:coactivator, (2) positive 
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hybrid systemxorepressor, (3) reverse hybrid systemxoactivator, (4) | reverse hybrid 
system:corepressor. (5) direct interaction assay, or (6) other art-know assay for identifying and/or ' 
quantifying ligand efficacy and/or potency as an antagonist or agonist of nuclear receptors. A 
preferred multiplex assay is a combination of (1) and (5). Additionally, an individual type of format 
5 can be multiplexed with regard to LBD species employed and/or CA or CR species employed (e.g., a 
positive hybrid system with an estrogen receptor LBD and a second positive hybrid system with a ' 
thyroid hormone receptor LBD). Combinations of format-multiplexed and LBD-multiplexed, CA- 
multiplexed, and/or CR-multiplexed systems and methods can be used in an individual method 
embodiment to enhance sensitivity and/or selectivity of identification of agonists and antagonists of 
10 nuclear receptors. 

In embodiments where cell-based assays are multiplexed, it is often preferred to 
employ multiple distinct species of reporter cells in each assay chamber (e.g., well of a cell culture ' 
plate), with each species of reporter cell providing a distinguishable and detectable readout. In a 
15 variation using cell-based assays, flow cytometry (which can include capillary cytometry) is used to 
detect readout signals/often with the use of diode lasers or tunable illumination sources in cases 
where the readoiit involves fluorescent signals. 

- Specificity of Bioeffect Fingerprinting 

20 in an aspect, the invention provides a method for identifying a candidate 

pharmaceutical agent from a library of test agents, wherein the candidate pharmaceutical agent has a 
desired biological effect profile ("bioeffect fingerprint"). The method comprises: (1) performing n 
(where n is a number greater than 2, preferably greater than 3, and less than 10 billion) distinct assays 
of the invention individually using each discrete test agent (which may be a mixture) of the library so 

25 as.to obtain for each individual test agent measurements of at least n biological. effects as detected as 
a ligand-induced conformational change or a binding interaction change in a discrete assay, (2) for 
each biological effect detected, assigning a score value (binary or quantitative) based upon the 
detection (or lack thereof) of a ligand-induced conformational change or binding interaction in each ' ' 
assay, separately, to generate a score matrix ("bioeffect fingerprint") for each agent, and (3) to 

so compare each agent's score matrix to an equivalent (i.e., representing the same assays under similar 
conditions) score matrix for one or more predetermined agonist(s) and/or antagonist(s), and thereby 
identify agents having score matrices substantially similar to the score matrix(ces) bf.said 
predetermined agonist(s) or antagonist(s). The substantial identity determination may be made on the 
basis of the general knowledge and experience of a skilled practitioner in.the art. and/or may made on 

35 the basis of a rank-ordering of matrix similarity (i.e., total number of matches), and/or may be made by 
electronic computation using a trained neural network implementation (e.g., BrainMaker running on a ' 
Windows platform trained.with data from a plurality of predetermined agonists and/or antagonists in • 
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. similar assays), or by other means. This method provides identification of novel receptor ligands and . 
their characterization based on bioeffect measurements made using in vitro assays and/or cell culture 
assays. In general, the predictive value of biological effects in vivo of such a multi-parameter method 
; increase with score matrix size (i.e., score matrixes having at least 5 cells or elements, each . 
5 representing a distinct assay type, are preferred). 

; In an aspect, the invention also provides novel polypeptides comprising a binding 
amino acid sequence that is: (1) non-naturally occurring in a nuclear protein, and (2) predetermined to 
bind to a nuclear receptor, typically by interaction with a binding interface of a nuclear receptor at " 
10 which naturally-occurring coactivators and/or compressors bind. These polypeptides and 
compositions thereof are candidate antagonists for coactivator or compressor binding (e.g., 
competitive inhibitors), can be employed as a CA portion of a CA-TRX fusion or as a CR portion of a 
CR-TRX fusion for use in assays of the invention, such uses can include commercial sale as reagents, 
and for other uses apparent to those skilled in the art. Such binding amino acid sequences generally 
15 comprise a LXXLL motif (wherein L is leucine and X is any of the conventional amino acids) and 
include, with regard to the LBD of the human estrogen receptor , the sequences shown in Fig. 16. 
The novel polypeptides are generally from 5 to 50,000 amino acids long, most usually from 10to500 
amino acids long, -and may comprise one or multiple species of such binding amino acid sequences, 
which may be repeated, and the remainder of the polypeptide may contain other naturally occuring 
20 sequences (e.g., fusions to known proteins) and/or may contain random, pseudorandom, or defined 
sequence kernal amino acid sequence(s). Peptidomimetics having structural homology to the novel 
binding sequences are also provided according to methods known in the art; such peptidomimetics 6r 
constrained peptides can provide novel therapeutic drugs to provide agonism or antagonism of one or 
nior6 nuclear receptor ligands. 

25 • - ; . . •. ' \ .. - . .■ . . • • • . " ' 

In an aspect, the nuclear receptor signal transduction system may comprise a . 
transgenic nonhuman animal having all or some somatic cells which are reporter cells. In this aspect, 
the transgenic nonhuman animal may be J adminstered a test agent which may act indirectly (i.e, other 
than by bindinglo the LBD) or as an LBD ligand. Detection of reporter expression at a statistically- 
30 significant level in said one or more somatic cell types indicates agonist activity of the test agent. In an. 
embodiment, lymphocytes of said transgenic animal are reporter cells and are collected from said 
transgenic animal following administration of a test agent to the transgenic animal and.a suitable. 
incubation period; the relative abundance .of lymphocytes expressing the reporter protein as. 
referenced to lymphocytes from an untreated transgenic animal under similar conditions except . 
35 lacking administration of the test agent serves to identify the agonistic or antagonistic efficacy, if any, ,/ 
of the test compound for activation of the predetermined 
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ln a variation, bioeffeci fingerprinting allows profiling of a plurality of compounds (test 
. agents), said profiling comprising contacting a compound (test agent) with a plurality of reporter cell 
species and/or non-cellular reporter assays in a plurality of individual discrete reaction vessels, with 
each discrete reaction vessel containing a unique test agent species, typically at a predetermined 
5 concentration, and determining, by comparison to a parallel set of assays employing as test agents 
compounds having predetermined pharmaceutical activities, a bioeffect fingerprint for each test agent, 
comprising a tabulated data compilation, often in matrix format and stored electronically in an 
electroniocomputer, indicating the assay readouts for said test agent and the predetermined 
pharmaceutical activities most closely similar, based on statistical analysis, to a test agent, and 
10 thereby categorizing test agents based upon similarities to known pahramceutically active compounds. 

' . Orphan Receptors 

The methods and compositions of the present invention can be advantageously used 
to identify ligands to orphan receptors for which physiological ligands are unknown. Two-hybrid 
15 systems can be employed to identify polypeptide sequences having binding activity with regard to the 
orphan receptor, and/or peptidomimetics can be employed to identify binding motifs which can mimic 
physiological coactivators and corepressors, even though such physiological coactivators and 
corepressors might remain unidentified. 

20 Sucn compositions, reporter host cells, transgenic nonhuman animals, and kits find 

use as commercially marketable drug development reagents, sensitive diagnostic detection systems 
for environmental ligands (e.g., to assay polycyclic aromatic hydrocarbons in a sample by detecting 
• their a 9°n'stic action on the Ah nuclear receptor and as reagents to titrate biopotency of a nuclear 
receptor ligand for preparation of pharmaceutical formulations. " 

25 ' v . * - • . • • " •'■ - ' . ;•• ' ■;• \ ■'■ ... v ■ 

Other features and advantages of the invention will be apparent from the following 
description of the drawings, preferred embodiments of the invention, the examples, and the claims: 

BRIEF DESCR IPTION OF THF FIGURES 

30 R 9- 1 is a schematic representation of an exemplary embodiment of a direct 

interaction assay format (ELISA)for measuring interaction of a nuclear receptor LBD with a binding 
fragment of steroid receptor coactivator-1 (SRC-1). The LBD of the nuclear receptor (ER = estrogen 
receptor, TR / = thryoid hormone receptor or form) fused in-frame to a glutathione ^transferase . 
sequence (GST) to form the binding member for the nuclear receptor LBD which is immobilized on a 

35 substrate indicated as a well surface, such as by electrostatic binding to a plastic 96-well plate. The 
coact.vator member contains 3 leucine charged domains (LCDs)of SRC-1 fused in-frame to GST/MBP , 
at the amino-terminus and fused in-frame to an antibody 179 epitope tag ^t the carboxy-terminus 
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The LCD sequences are shown. The SRG-1 fusion member that is bound to the immobilized LBD is 
detected with a mouse antibody that specifically reacts With the 179 epitope and a second antibody 

(alkaline phosphatase conjugated antimouse Ab). 

# . ■ . ■■ - 

5 Fig. 2 shows the results of the direct interaction assay using ER LBD interacting with 

SRC-1 . 0.5 \ig of purified GST-ER was used with varying amounts of crude bacterial lysate containing 
GST-SRC-1 . The interaction is specific (GST control) and enhanced by estradiol. 

Fig. 3 shows the results of the direct interaction assay using ER LBD interacting with 
io SRC-1. 0.5 \sg of purified GST-ER was used with varying amounts of crude bacterial lysate containing 
GST-SRC-1 . The interaction is specific (GST control) and enhanced by estradiol. 

Fig. 4 shows results of a direct interaction assay of the TR and TR LBDs interacting 
with SRC-1 . The indicated amounts of GST fusion proteins were used. The interaction is specific and 
15 dependent on T3. 

Fig. 5 shows results of a direct interaction assay as a doserresponse curve of T3 in 
promoting interaction between the TR and TR LBDs and SRC-1 . T3 appears more potent on the 
receptor: 

Fig. 6 shows a schematic portrayal of LRB-TRX, CA-TRX, and CR-TRX constructs 
used and the reporter polynucleotide of a two-hybrid system; •"Receptor" indicates LBD-TRX, with the 
LBD exemplified by a genereic nuclear receptor (NR) LBD with bound ligand (L), and the TRX domain 
represented by the VP16 acidic activation domain (VP1 6). "Cbactivator" indicates a CA-TRX, with 
25 exehfiplary CA species being a portion of the SRC-1 protein, a portion of the CBP protein, or isolated , 
; LCD sequences from those proteins, the TRX domain is exemplified as the Gal4 DNA binding domain 
(DBD). "Compressor" indicates a CR-TRX, with exemplary CR species being the interacting domains 
(ID1 or i02) from SMRT or NcoR fused to the TRX domain as exemplified by Gal4 DNA binding 
domain (DBD). The reporter polynucleotide is exemplified as a UAS-TK-luciferase reporter containing 
30 binding sites for Gal4 DBD. CHO cells were triple transfected with these constructs and hormone . 
added. 

Fig. 7 shows data from a positive two-hybrid system as a dose-response curve 
Showing that reporter (luciferase) is generated only wheh all three components (VP16ER ( Gal4-SRC : 
35 1 , and. UAS-TK-luciferase) are transfected. together, and that the response is dose-dependent with 
respect to estradiol concentration. 
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Fig. 8 shows data from a positive two-hybrid system as a dose-response curve 
showing that reporter (luciferase) is generated only When all three components (VP16TR , Gal4-SRC- 
1 , and UAS-TK-luciferase) are transfected together, and that the response is dose-dependent with 
respect to T3 concentration. 

Fig. 9 shows coactivator/corepressor interactions with TR LBD in the positive two- 
hybrid system. NcoR ID1 , and to a iesser extent SMRT ID1 interact with TR LBD, but neither ID2 
interacts strongly with the LBD. T3 induced dissociation of the corepressor fragments at significantly 
lower concentrations than is required to recruit a coactivator. 

Fig. 1 0 shows coactivator/corepressor interactions with ER LBD in the positive two- 
hybrid system. NcoR ID1 , and to a lesser extent SMRT ID1 interact with ER LBD, but neither ID2 
interacts strongly with the LBD. None of the four Ids interact significantly with the ER LBD in the 
absence of ligand. ' . , . 



15 



Fig. 1 1 shows a schematic portrayal of a method for identifying novel LCD sequences 
from libraries of displayed peptides. Peptide libraries were constructed by fusing sequences to the 
Lac repressor (Lad). The encoding plasmid contains operator sites so that a bacterial lysate contains 
the displayed peptides bound to the encoding plasmid. The library was screened with immobilized 
20 GST-ER inthe presence of estradiol in four rounds of enrichment. 

Fig. 12 shows ELISA signals for Lacl-f used peptides obtained from either a focused (- 
LXXLL-) or random (-XXXXX-) 15mer library in the presence of ER and estradiol. 

25 Fig. 13 shows the sequences of Lad-fused peptides obtained by screening the library 

by ER panning. The top set of sequences was obtained.from.the focused (-LXXLL-) library and the 
bottom set was obtained from the random sequence (-XXXXX-) library. Notable amino acids are 
highlighed. 

30 Fig. 14 shows ELISA data from the fourth round selectarits of the Lad-fusions 

(tetravalent reagent) which were subdoned as a population into an. MBP vector (monovalent reagent) 
and random clones tested without estradiol. Control is SRC-1 fragment (597-781) containing the 3 
LCDs. 



35 



Fig. 15 shows ELISA data from the fourth round selectants of the Lacl-fusibns 
(tetravalent reagent) which were subdoned a's.a population into an MBP vector (monovalent reagent) 
and random clones tested with estradiol. Control is SR&1 fragment (597-781) containing the 3 LCDs! 
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Note the estradiol dependence of many of the qlones for interaction with ER (compare data in Fig. 14 
with Fig; 15). 

Fig. 1 6 shows the sequences of the selected clones, Notable amino acid residues are 

5 highlighted. - 

Fig. 17 shows a schematic hypothetical example of bioeffect fingerprinting using a 
two dimensional score matix. 

10 DETAILED DESCRIPTION 

All publications cited herein are incorporated herein in their entirety as if reproduced 

verbatim. 
DEFINITIONS 

is Unless defined otherwise, all technical arid scientific terms used herein have the 

• same meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, the preferred methods and materials are 
described. For purposes of the present invention, the following terms are defined below. 

20 - ' . . ■ ,. " , .. ;■ . - '. . ;.. - . .. '. 

. . As used herein/the term "coactivator" is any limiting factor that enhances the 
transcriptional activity of a nuclear receptor without substantially altering basal transcription with 
respect to at least one gene, that can be transcriptionally regulated by the nuclear receptor in a ligand- 
dependent manner, and which binds to the nuclear receptor via direct protein-protein cootact(s); 

•25 typically also in a jigand-dependent or Af-2 domain-dependent manner. Examples of some 
coactivators include, but are not limited to, SRC-1 ; ERAP1 40, RIP140, RIP1 60, Tripl , SWI1/SNF, 
ARA70, RAP46, f IF1, QBP, p30Q, TIF2, GRIP1 , TRAP complex proteins, and other such proteins as 
are known in the art, including those described in Glass et al. (1997) Gurr. Qpin. Cell Biol. 9: 222 or 
Horwitz et al. (1996) Moi Endocrin/ 10: 1 167. Coactivators typically contain at least one -LXXLL- mptif 

30. (LCD), typically at least two or three such motifs. Coactivators typically bind to LBDs in an AF-2 
dependent mariner. Coactivators can include amino acid sequences which are not found in nature but 
which are identified by a peptide screening method as binding to a LBD of a nuclear receptor in a • 
ligand-dependent manner. 

35. As used herein the term "compressor' is any limiting proteinaceous factor that inhibits 
transcription after being tethered to a promoter by DNA-bound receptors or which blocks binding of a 
coactivator; factors whose binding to receptors. isligand-regulated;.-and factors whose -inhibitory effect. 
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on transection can be relieved in a ligand-dependent manner. Examples of some corepressors 
include, but are not limited to, N-Cor, SMRT, calreticulin, TRUP, and Tup 1, and other such prbteins as 
are known in the art, including those described- in Glass et al. (1997) Curr. Qpin/CellRini g- 222 0 r 
Horw.t2etal/(1996)MgKEndo^ 1 1 67. Corepressors typically contain at least one or more 
5 ; interact,on domains (IDs), typically at least two or three such motifs. 

As used herein, the twenty conventional amino acids and their abbreviations follow 
conventional usage (Immunology - A Synthesis, 2nd Edition, E.S. Golub and D.R. Gren Eds Sinauer 
. Assocates, Sunderland, Massachusetts (1991), which is incorporated herein by reference) 
io Sterecsomers (e.g., D-amino acids) of the twenty conventional amino acids, unnatural amino acids 
« such as , ^substituted amino acids, N-alkyl amino acids, lactic acid, and other unconventional amino 
acids may also be suitable components for polypeptides of the present invention. Examples of • 
unconventional amino acids include: 4-hydroxyproline. -carboxyglutamate, -N.N.N-trimethyllysine - 
N-acetyllys.ne, O-phosphoserine, N-acetylserine. N-formylmethionine. 3-methylhistidine 5- ' 
15 ■ hydroxylysine, -N-methylarginine, and other similar amino acids and imino acids (e g f 

hydroxyproline). In the polypeptide notation used herein, the lefthand direction is the amino terminal 
d,rect.on and the righthand direction is the carboxy-terminal direction, in accordance with standard 
usage and convention. Similarly, unless specified otherwise, the lefthand end of single-stranded 
polynucleotide sequences is the 5' end; the lefthand direction of double-stranded polynucleotide : 
o sequences is referred to as the 5' direction. The direction of 5' to 3" addition of nascent RNA 
transcripts ,s referred to as the transcription direction; sequence regions on the DNA strand having the 
same sequence as the RNA and which are 5' to the 5' end of the RNA transcript are referred to as 
upstream sequences"; sequence regions on the DNA strand having the same sequence as the RNA 
^ and which are 3' to the 3' end of the RNA transcript are referred to as "downstream sequences". ■ 

As applied to polypeptides, the term "substantial identify" means that two peptide 
sequences, when optimally aligned, such as by the programs GAP or BESTFIT using default gap : 
we,ghts. share at least 80 percent sequence identity, preferably at least 90 percent sequence identity ^ 
more preferably at least 95 percent sequence identity or more (e.g., 99 percent sequence identity) ' 
Preferably, res.due positions which are not identical differ by conservative amino acid substitutions 
Conservative amino acid substitutions refer to the interchangeability of residues having similar side 
chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine valine 
eucne. and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and 
threpmne; a group of amino acids having amide-containing side chains is asparagine and glutamine- a 
group of am.no acids having aromatic side chains is. phenylalanine, tyrosine, and tryptophan; a group 
of am.no acds having basic side chains is lysine! arginine. and histidine; and a group of amino acids ' ■ 
hav.ng sulfur-containing side chains is cysteine and methionine. Preferred conservative amino acids • ' 
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substitution groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine- 
. valine, and asparagine-giutamihe. 

The term "fragment" as used herein refers to a polypeptide that has an amino- 
5 terminal and/or carboxy-terminal deletion, but where the remaining amino acid sequence is identical to 
the corresponding positions in the naturally-occurring (e.g., mature protein) sequence deduced, for 
example, from a full-length cDNA sequence of a coactivator, compressor, or nudear receptor. 
Fragments typically are at least 14 amino acids long, preferably at least 20 amino actds long/usually 
at least 50 amino acids long or longer and contain interaction interfaces such as LCDs, IDs, or AF-2. 
io . . . • , .- \ . ;• \ : ; ' ; . 

The term "analog" as used herein refers to polypeptides which are comprised of a 
. . segment of at least 25 amino acids that has substantial identity to a portion of an LBD, CA, or CR, and 
which has specific binding to a nuclear receptor, CA, or CR species. Typically, analog polypeptides 
comprise a conservative amino acid substitution (or addition or deletion) with respect to the naturally- 
is occurring sequence. Analogs typically are at least 20 amino acids long, preferably at least 50 amino 
acids long or longer, most usually being as long as full-length naturally-occurring polypeptide 

The term "polypeptide" is used herein as a generic term to refer to native protein, 
fragments, or analogs of a polypeptide sequence. Hence, native protein, fragments, and analogs are 
20 species of the polypeptide genus. 

The term "naturally-occurring" as used herein as applied to an object refers to the fact 
that an object can be found in nature. For example, a polypeptide or polynucleotide sequence that is 
present in an organism (including viruses) that can be isolated from a source in nature and which has 
25 not been intentionally modified 1 by man in the laboratory is naturally-occurring. As.used herein, 

laboratory strains of rodents which may have been selectively bred according to classical genetics are 
considered naturally-occurring animals. 

As used herein, the term "LBD" or iigand-binding domain" refers to the protein 
30 domain of a nuclear receptor, such as a steroid superfamily receptor or other suitable nuclear receptor 
: as discussed herein, which binds a physiological ligand (e.g., a steroid hormone) and thereupon * 
; undergoes a conformational change and/or altered intermodular interaction with an associated 
protein so as to confer a detectable activity upon a second, Jinked functional domain. When a suitable 
efficacious ligand interacts with the LBD and forms a liganded complex with an agonist, the LBD is 
35 activated and one or more coactivator species become bound. When a suitable efficacious ligand 
interacts with the LBD and forms a liganded complex with an antagonist* or in some cases inthe 
absence of bound ligand, the LBD binds to one or more 'compressor species. 
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Ari agonist ligand is defined herein as a ligand which, when bound by the LBD, 
induces a conformational change and/or alteration of an intermolecular or intermolecular interaction of 
the LBD so as to induce binding of coactivator and/or dissociation of corepressor. An antagonist 
ligand is defined herein as a ligand which opposes the effect of an agonist ligand, generally bydose- 
5 dependent competitive binding, or by otherwise inducing or stabilizing binding of the LBD to a 
corepressor or inhibiting LBD binding to one or more coactivator species. Nuclear receptor LBDs are 
known to and/or can be identified by those skilled inWart and as described herein. Exemplary LBDs 
am obtained from steroid hormone receptors (e.g., ER, GR, PR, AR, EcR), retinoic acid receptors - 
(e.g., RAR, RXR, RZR), peroxisome proliferator activator receptors (e.g., PPARs) thyroxine (T3) 
to receptor (TR), vitamin D receptor (e.g., VDR), and farsenoid receptor (e.g., FXR), aryl hydrocarbon 
receptor (Ah) orphan nuclear receptors, among others. The invention is not limited to known nuclear 
receptors, but can employ LBDs from any suitable nuclear receptor having an LBD which, in the. 
absence of bound ligand, binds corepressor and/or fails to bind coactivator species. 

15 As used herein, the term "LBD-TRX" and "LBD-TRX fusion protein" refer to a 

polypeptide comprising: (1) a portion of a nuclear receptor (e.g., a steroid receptor superfamily 
member) which is capable of binding a predetermined ligand species, typically a physiological ligand 
(e.g., a steroid hormone), with an affinity of at least about 1 xtO' M and which, in the absence of 
bound ligand, can bind corepressor and/or fails to bind one or more coactivator species which can 
20 become bound in ligand-dependent manner; and (2)a catalytically active portion of a transcription • 
factor DBD or AD. .. In some embodiments, a polypeptide spacer of about 1-25 amino acids may 
separate the LBD portion from the TRX portion. Spacers are preferably noninterfering sequences (i e 
which permit the desired functioning of the LBD-TXR in a cell nucleus), and can be readily determined 
and selected by the practitioner: In some embodiments, amino-termin^ . 
25 extensions of additional amino acids in noninterfering sequences can be present. For typical LBD-TRX 
- Species of the invention, the linear order of the LBD and TRX, from amino-terminus to carboxy- ' 
terminus, can be varied so lang as the desired function, ligand-inducible activation of binding to 
coactivator and/or dissociation of corepressor with the LBD-TRX fusion protein, is retained. 

30 As us ed herein, the term "cell surface reporter" refers to a protein which localizes to ; ' 

the plasma membrane of a host cell and a portion of the protein is exposed to the extracellular region 
(e.g., an integral membrane protein or a transmembrane glycoprotein, an engineered fusion between ■ 
a detectron protein and a membrane tether sequence), wherein >said extracellular portion can be 
bound by a specific antibody or other ligand with an affinity of at least about 1x10' M ' and/or wherein 

35 sa,d extracellular portion can catalyze a detectable conversion of a substrate to a product The term 
.cell surface reporter also refers to a polynucleotide sequence encoding such a cell surface protein 
Various cell surface proteins can be used as cell Surface reporters, including fusion proteins 
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, comprising at least one segment (e.g., an epitope of at least about 3 consecutive amino acids) that is 
' not present in a naturally-occurring cell surface protein. For example but not limitation, the following 
may be used as cell surface reporters: a CD (cluster of differentiation) antigen present on cells of a 
hematopoietic lineage (e.g., CD2, CD4, CD8, GD21, etc.), -glutamyltranspeptidase, an adhesion 
5 protein (e.g., ICAM-1, ICAM-2, ELAM-1, VCAM-1), spike glycoproteins of enveloped viruses (e.g., 
glycoprotein H of human cytomegalovirus (hCM V)), and the membrane-bound form of an 
immunoglobulin p chain. Also for example but not limitation, a fusion protein between a 
transmembrane protein and a -galactosidase protein may compose a cell surface reporter (i.e., a . 
fusion protein having -galactosidase sequences linked to a -glutamyltranspeptidase heavy subunit); 
io cells bearing such fusion proteins dan be affinity enriched using an immobilized anti- -galactosidase 
antibody. A preferred cell surface reporter protein is a fusion of an enzyme, or a catalytically active 
portion thereof, to a membrane tether sequence, particularly where the enzyme produces a detectable 
conversion of substrate to product and wherein the host cell membrane is substantially impermeable 
to said substrate (e.g M luciferase); Preferably, a cell surface reporter protein is a protein which is not 
is normally expressed at significant levels on the host cells. Since luciferase is an invertebrate protein, 
fusion proteins comprising luciferase are preferred cell surface reporters for use with mammalian host 
cells. Numerous other specific examples of suitable cell surface reporters, such, for example, various 
cloned CD ("cluster of differentiation") antigens, are known to those of skill in the art (e.g., from the 
literature and GenBanfc™; such as CD2 GenBank sequence files: Humcd21, Humcd22, Humcd23, 
20 Humcd24, and Humcd25, and Muscd21, Muscd22 l .Muscd23, Muscd24, and Muscd25) and may be 
selected for use in the methods and polynucleotide constructs of the invention on the basis of the 
practitioner's desired application. 

As used herein, the terms "expression cassette" refers to a polynucleotide comprising 
25 a promoter sequence and, optionally, ah enhancer and/or silencer element(s), operably linked to a 
structural sequence, such as a cDNA sequence or genomic DNA sequence. In sortie embodiments, 
an expression cassette may also include polyadenylation site sequences to ensure polyadenylation of 
transcripts. If the encoded protein is to be secreted or retained in the cell membrane, a signal 
sequence is generally encoded at the amino-terminal portion of the encoding sequence. When an 
.30 expression cassette is transferred into a suitable host cell, the structural sequence is transcribed from 
r the expression cassette promoter, and a translatable message is generated; either directly or following 
. appropriate RNA splicing, typically, an expression cassette comprises: (1) a promoter, such, as ah ... 
■SV40 early region promoter, HSV tk promoter or phosphoglycerate kinase (pgk) promoter, CMV . 
< promoter, Sr promoter or other suitable promoter known in the art, (2) a cloned polynucleotide 
35 sequence, such as a cDNA or genomic fragment ligated to the promoter in sense orientation so that;, 
transcription from the promoter will.prpduce a RNA that encodes a functional protein, and (3) a . 
polyadenylation sequence. For. example and not limitation, an expression cassette of the invention 
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may comprise the cDNA expression cloning vectors, pcD and NMT (Okayama H and Berg P (1983) 
MoL CelL BiQl - 3 ; 280; Okayama H and Berg P (1985) Mol. Cell. Biol. 5: 1 136, incorporated herein by 
reference) or other preferred expression vectors known and available in the art. The transcriptional 
regulatory sequences in an expression cassette is selected by the practitioner based on the. intended 

5, -application; depending upon the specific use, transcription regulation can employ inducible, 
repressive, constitutive, cell-type specific, developmental stage-specific, sex-specific, or other desired 

. type of transcriptional regulatory sequence. 

As used herein, the term "reporter polynucleotide" refers to a polynucleotide 
io sequence comprising, in linear order, an operably linked transcriptional regulatory sequence and a 
reporter cassette. In a variation, the reporter polynucleotide encodes a selectable marker which can 
be selected for in order to ensure the presence of the reporter polynucleotide in the reporter host cells. 
Preferred selectable marker genes include, but are not limited to, G-418*, mycophenolic acid 
resistance, DHFR. HSV-tk, and the like Preferred reporter cassettes include sequences encoding a . 
15 protein which can be expressed on the cell surface of a metabolically active cell and detected without 
resulting in non-viability of the cell; preferably such proteins can be readily cleaved from the cell 
surface if desired so as to reduce background. A reporter polynucleotide may comprise tandem 
arrays of genetic elements at the discretion of the practitioner. 

20 As used herein, the term "reporter host cell" refers to a eukaryotic cell, preferably a 

mammalian cell, which harbors a reporter polynucleotide, a LB D-TRX expression cassette, arid a CA- 
TRX expresison cassette and/or a CR-TRX expression cassette. The reporter polynucleotide 
sequence may reside, in polynucleotide linkage, on the same polynucleotide as One or more of the 
expression cassette sequences, or may reside on a separate polynucleotide. The expression 

25 cassettes and/or the reporter polynucleotides may be present as an extrachromosomal element (e.g 
rephcon), may be integrated into a host cell chromosome, Or may be transiently transfected innon- ' 
replicable, non-integrated form. Preferably, the expression cassettes and reporter polynucleotide are 
both stably integrated into a host cell chromosomal , location, either by nonhomologous integration or 
by homologous sequence targeting. 



30 



The term "transcriptional modulation" is Used herein to refer to the capacity to either 

enhance transcription or inhibit transcription of a structural sequence linked incis;such enhancement. 

or mhibitipn may be contingent on the occurrence of a specific event; such as stimulation with an '. 

inducer and/or may only be manifest in certain cell types. For example but not for limitation 
35 expression of a protein that prevents formation of an activated (e.g., ligand-bound) glucocorticoid 

receptor Will alter the ability of a glucocorticoid-responsive cell type to modulate Transcription of an 
. glucocorticoid-responsive gene in the presence of glucocorticoid. This- alteration will be manifest as 
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an inhibition of the transcriptional enhancement of the glucocorticoid-responsive gene that normally 
ensues following stimulation with glucocorticoids. The altered ability to modulate transcriptional 
enhancement or inhibition may affect the inducible transcription. of a gene, such as in the just-cited 
example, or may effect the basal level transcription of a ,gene, or both. For example, a reporter 
5 polynucleotide may comprise a glucocorticoid-inducible enhancer-promoter driving transcription of a 
sequence encoding a cell surface reporter protein. Such a reporter polypeptide may be transferred to 
a glucocorticoid-responsive cell line for use as a reporter host cell. Cloned sequences that silence 
expression of the cell surface reporter in cells cultured in the presence of glucocorticoids also may be 
(e:g., to reduce basal transcription and ensure detectable glucocorticoid inducibility). Numerous other 

10 specific examples of transcription regulatory elements, such as specific enhancers and silencers, are 
known to those of skill in the art and may be selected for use in the methods and polynucleotide 
constructs of the invention on the basis of the practitioner's desired application. Literature sources 
and published patent documents, as well as.GenBank™ and other sequence information data sources 
can be consulted by those of skill in the art in selecting suitable transcription regulatory elements and 

is other structural and functional sequences for use in the invention. Where necessary, a transcription . 
regulatory element may be constructed by synthesis (and ligation, if necessary) of oligonucleotides 
madie on the basis of available sequence information (e.g., GenBank sequences for a CD4 enhancer, 
HSV-TK promoter, or a SV40 early promoter). 

20 . As used herein, "linked" means in polynucleotide linkage (i.e., phosphodiester 

linkage) or polypeptide linkage, depending upon the context of usage. "Unlinked" means not linked to 
another polynucleotide or polypeptide sequence; hence, two sequences are unlinked if each 
sequence has a free 5' terminus and a free 3' terminus. 

25 As used herein, the term "operably linked" refers to a linkage of polynucleotide 

elements in a functional relationship. A nucleic acid is "operably linked" when it is placed into a 
functional relationship with another nucleic acid sequence. For instance, a promoter or enhancer is 
operably linked to a coding sequence if it affects the transcription of the coding sequence. Operably 
linked means that the DNA sequences being linked are typically contiguous and, where necessary to . 

30 join two protein coding regions, contiguous and in reading frame. However, since enhancers 

generally function when separated from the promoter by several kilbbases and intronic sequences . :• 
may be of variable lengths,: some polynucleotide elements may be operably linked but not contiguous. 
A structural gene (e.g., a HSV tk gene) which is operably linked to a polynucleotide sequence 
corresponding to a transcriptionaf regulatory sequence of an endogenous gene is generally expressed 

35 in substantially the same temporal and cell type-specific pattern as is the naturally-occurring gene; 
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ra,e r d t , as lhe 5 , ;iirec:ioa ^ d ^ 0( „ ,,,, of ^ RNA ,^ ^ 

5 le *A ,ra :~' ,e * ni ' e9fenS °" ' he DNA *«* »" serae se, e i as 

5 fte RNA and which are 5' ,o ,he 5'end o ( the RNA transcript are referred io as Wraara 

nT o rZ S °" * ^ -*» «» Mme ~ - *' RNA anLo^ 3- to £ ' 
3 end of the RNA transtrfpt are referred 10 as -downstream sequences'. . 

As used herein, the term -transcriptional regulatoty sequence" refers to a : 
•0 po,nu*otidese^ 

transcnbable pojynucieolide sequence. A positive transcriptional regulatory eCis aCNA 
, sequence *h activates -ranschption^ ; 
sequences. Typically, transcriptional regufctay sequences oomp'rise a promoter fre^emtan 

vectors, transonptton rate assays, and the like). Often, transcriptional regulatory sequences include a 

3 ' aC,0f reG09 "" ta *«"* '^Thetemfoftenrere^™ 3 

sequence compose a funofionaf promoter and any associated transcription elements (e g 
20 enhancer, CCAAT box, TATA box, SPf site etc.) that are essential for Iranscriptforvof a 

of oroducino , n ""^'^ e " ha ^™nf is used herein to refer to functional property 

9 " mCreaSem 0, Wed sequences*,, confain a functional •/ 



25; promoter. 



' • nn, , ,„ AS US6d herein - th e term "transcriptional unit" or "transcriptional complex" refers to a 

30 Tl C,,ng necessary for efficient transcription of the structural sequences distal 

^ Z ,GraPPr ° Priate tiSSUe ' Speeifie " d -.opmenta, tranlcnp^ £ 

translate (e.g., P olyadenylat.on site, mRNA stability controlling sequences). 



35 



bind™ site- , ' l,ana * li » fa *»^ognifion site- and a. Wcription factor . 

stesl £ 3 ^ °°" de ™ W °' »*» * *n« ,, being 
fH equence ' s ^'" c °f oneo, more transcription factors. frequently ta«g me 

form „, dtrec, pro,e,,DNA bihding, Typfcaiy. transcnpbon lacor binding sites can be ide X 
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DNA footprinting, gel mobility shift assays, and the like, and/or can be predicted on the basis of known 
consensus sequence motifs, or by other methods known to those of skill in the art. For example and 
not to limit the. invention, eukarybtic transcription. factors include, but are not limited to: N FAT, AP1 , "■ 
. AP-2, Sp1 , OCT-1, OCT-2, OAP, NF B, CREB, CTF, TFIIA, f FIIB, TFND, PiM , C/EBP, SRF (Mitchell 
5 PJ and TijanR (1989) Science 245: 371). For purposes of the invention, steroid receptors, steroid 
superfamily receptors, other nuclear receptors, RNA polymerases, and other proteins that interact with 
DNA in a sequence-specific manner and exert transcriptional regulatory effects are considered 
transcription factors. 

0 As used herein, the term "functionally expressed" refers to a coding sequence which 

is transcribed, translated, post-translationally modified (if relevant), and positioned in a cell such that 
the protein provides the desired function. With reference to a reporter cassette, functional expression 
generally means production of a sufficient amount of the encoded cell surface reporter protein to 
provide a statistically significant detectable signal to report ligand-induced transcriptional effect of a 
15 reporter polynucleotide. 

The term "agent" is used herein to denote a chemical compound, a mixture of 
chemical compounds, an array of spatially localized compound 

polynucleotide array, and/or combinatorial small molecule array), a biological macromolecule, a ■ . 

20 bacteriophage peptide display library, a bacteriophage antibody (e.g., scFv) display library, a 

polysome peptide display library, or an extract made from biological materials such as bacteria, plants, 
fungi, or animal (particularly mammalian) cells or tissues. Agents are evaluated for potential activity 
as nuclear receptor agonist agents by inclusion in screening assays of the methods described herein 
or the like. Agents are evaluated for potential activity as specific nuclear receptor agonistsby . - 

25 inclusion in screening assays described hereinbelow; Agents may act directly, by serving as a ligand 
to the LBD of the LBD-TRX fusion protein, and/or they may act indirectly, by producing a second 
messenger which activates the LBD (e.g., a kinase which phosphorlates the LBD, a second hormone 
which is a ligand of the LBD, and the like); A "test agent" is an agent which is being evaluated by the 
methods of the invention to determine whether is possesses agonist activity, antagonist activity, or no 

30 \ activity with regard to the LBD of the nuclear receptor. • 

As used herein, the term "test ligand" is' an agent which has chemical properties 
consistent with the agent serving as a ligand of a predetermined nuclear receptor LBD with an affinity' 
of at least abouU x 10 6 M". Typically, these properties can include small size (e:g., MW less than-. , 
35 3,000 Daltons), hydrophobic^, structural similarity to known ligahds of the receptor LBD, and/or 
predicted interaction with an' LBD binding pocket based on computer'modeling. 
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As used herein, the term "agonist" refers to an agent which produces activation of a 
nuclear receptor and/or which, when contacted with a nuclear receptor signal transduction system of 
ttie.nvent.on. produces a substantial increase in binding of one or more coactivator protein(s) to an 
LBD-TRX fus.on protein, and/or relieves binding of one or more compressors to the LBD-TRX fusion 
5 protein. 

As used herein, the term "antagonist" refers to an agent which opposes the agonist 
act,v.ty of a known agonist of the nuclear receptor LBD, or enhances or stabilizes binding of a 

10 LBD-TRX fusion protein. / - 

"Physiological conditions" as used herein refers to temperature, P H, ionic strength 
v.scos,ty, and like biochemical parameters that are compatible with a viable organism, and/or that ' 

is nucleus of sa,d mammalian cell. For example, the intranuclear or cytoplasmic conditions in a 
mammalian cell grown undertvpical laboratory culture conditions are physiological conditions 
Suitable in yrtro reaction conditions for in yitio 

condtos. and may be exemplified by a variety of art-known nuclear extracts: In general in vitro 
, phys.olog.cat conditions can comprise 50-200 mM NaCI or KCI, pH 6.5-8.5, 20-45 C and O.OOl7o mM 
20 d.va.entcat.on (e.g., Mg", Ca»); preferably about 150 mM NaCI or KCI, pH 7.2-7.6. 5 mM divalent 

patior, and often include 0.01-1.0 percent nonspecific protein (e g, BSA). A non-ionic detergent 

NfM °' Trit0n X - 10 °) can 0ft6n ^ present, usually at about 0*01 to 2%, typically 0 05-0 2% " 

(v/v). Particular aqueous conditions may be selected by the practitioner according to conventional 

™ m !', 9enera ' 9UidanCe • the,0l,0win 9 buffered aqueous conditions may be applicable- 10-250 
25 mM NaCI, 5-50 mM Tris HCI, pH 5-8, with optional addition of divalent cation(s), metal chelators 
non.on.c detergents, membrane fractions, antifoam agents, and/or scintillants. . 

.. ; As "sed herein, the terms "label" or"«labeled» refer to incorporation of a detectable 

marker, e^. a radiolabeled amino acid or a recoverable label (e.g. biotinyl moieties that can be 
30 recovered by avidin or streptavidin) or an epitope tag or lined catalytic activity. Recoverable labels 
can .nclude covalently linked polynucleobase sequences that can be recovered by hybridization to a 
complementary sequence polynucleotide. Various methods of labeling polypeptides and 

• to S° tideS 3r ? 0VVn in th ' ^ ^ b * ExampleS ° f ,abe,s 'nclude. but are not limited - 
. to foe ol.ow.ng: radioisotopes (e.g., 5 H, »C, *S, ,?, l), fluorescent or. phosphorescent labels (e.g 

35 F V Cl rh0dam,ne : ,anthanide P hos P n °K), enzymatic labels (e.g.; horseradish peroxidase - 
. ^actos,dase.. lucif erase.. alkaline phosphatase), biotinyl groups, predetermined polypeptide epitopes 
. recogn,zed by a secondary 
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transcriptional activator polypeptide, metal binding domains, epitope tags). In. sortie embodiments, 
labels are attached by spacer arms of various lengths, e.g., to reduce potential steric hindrance. A 
"labeled antibody" can be a primary antibody having an attached detectable label or a primary 
antibody which can be detected by a secondary antibody having an attaqhed detectable label: 
• 5 ' : . - . . ■* . ;■ • • " 

As used herein, the term "statistically significant" means a result (i.e., an assay . 
readout) that generally is at least two standard deviations, above or below the mean of at least three : 
separate determinations of a control assay readout and/or that is statistically, significant as determined 
by Student's t-test or other art-accepted measure of statistical significance". 
.10 ■ . ^' .• • ' ■ ■- : 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Generally, the nomenclature used hereafter and the laboratory procedures in cell 
culture, ^molecular genetics, and nucleic acid chemistry and hybridization described below are those 

15 well known and commonly employed in the art. Standard techniques are used for recombinant nucleic 
acid methods, polynucleotide synthesis, cell culture, and transgene incorporation (e.g., 
electroporation, microinjection, lipofection). Generally enzymatic reactions, oligonucleotide synthesis, 
and purification steps are performed according to the manufacturer's specifications. The techniques 
and procedures are generally performed according to conventional methods in the art and various 

20 general references which are provided throughout this document, as well as: Maniatis et al. , Molecular 
Cloning: A Laboratory Manual (1 989) 2nd Ed., Cold Spring Harbor, N . Y. ; and Berge r arid Kimmel 
Methods in Enzymoloov. Volume 152. Guide to Molecular Cloning Techniques (1987), Academic J 
Press, Inc., San Diego, CA, which are incorporated herein by reference. The procedures are believed 
; to be w^ll known in the .art and are provided for the convenience of the reader. All the information . 

25 contained therein is incorporated herein by reference. . • 

Oligonucleotides can be synthesized on an Applied Bio Systems oligonucleotide 
synthesizer according to specifications provided by the manufacturer. 

J0 . •; All.seguences referred t9;herein- by GenBank database file: designation (e'.g.., . , 

• GenBank: Humatct4a) or otherwise obtainable by routine search of a publicly-available sequence 
dtabase orscientific publications and are incorporated herein by reference and are publicly available, 
such as by reconstruction of the sequence by overlapping oligonucleotides or other means: ' 

5 ^ All publications referred to are incorporated herein for all purposes, including as if the 

publications, including any'diagrams or figures, were reproduced herein.. 
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Overview 



In general, the invention encompasses methods, polynucleotide constructs, and host 
cells which are employed for generating and screening agents for the purpose of identifying agents 
which are agonists or antagonists of a predetermined nuclear receptor having a LBD which can in a 
5 first state unbound by agonist ligand, remain substantially unbound with regard to a coactivator' 
protein, typically a protein containing at least one -LXXLL- motif, and, in a second state bound by 
agonist ligand, bind to said coactivator protein. In general, the methods employ a LBD-TRX fusion 
protein to detect agonist ligands. Agonist ligands produce a conformational change in the LBD-TRX 
thereby activating a binding moiety that allows binding to a coactivator protein (or dissociation of a ' 
10 compressor protein), said binding moiety which is otherwise substantially repressed by the unliganded 
LBD. The result of the ligand-induced LBD alteration causes, binding of a CA-TRX to form a 
; transcriptionally active complex which can drive transcription of the reporter polynucleotide to produce 
functional expression of the reporter cassette. 

15 The invention separates the function of ligand binding from other functions of 

transcription factors, coupling ligand binding to cofactor-binding activity which can be assessed by a 
particularly advantageous means of measuring these inter-protein interactions. 

Wagner et al. (1998) Mol. Cell. Biol. 18: 1369, Horwitz (1996) ojxcjt, Glass et al 
20 (1997) opxit, and Krey et al. (1 997) Mol. Endocrin. 35: 779, describe methods for identifying LBD 
interactions with coactivators and corepressors in a ligand-dependent manner. 

The present invention provides methdds and compositions provide a means for highly 
sen.srt.ve, reliable, and automatable screening a bank bf agents to identify agonist agents and 
25 antagonists which are able to conformationally activate the LBD of a nuclear receptor. 

Nuclear Recebtor LBDs .' 

Nuclear receptors suitable for providing an LBD for the method are those receptors 
wh.ch are localized in the nucleus and/or can translocate to the nucleus upon ligand binding Such 

30 nuclear receptors which have a regulable domain which represses ah adjacent, or closely linked 
funct.onal domain, wherein such repression is substantially relieved upon activation of the regulable 
domain by an agonistic stimulus, such as by binding 'of /an agonist ligand, are suitable for use , Often 

, the regulable-domain binds a known.ligand, such as a steroid hormone, retinoid, or other generally ' 
hydrophobic small molecule ligand. Most usually, ligand binding and domain activation occurs in a 

35. stereospecfic manner which is selective for particular ligand geometries and structural features . 
Ribiero RC et al. (1995) o^t provides a general overview, of some such nuclear receptors. Glass et ; . 
al. (1997) p££Lt also provides examples of nuclear receptors. V 
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The list of known nuclear receptors includes, but is not limited to the steroid receptor 
. superfamily. (e.g., estrogen receptor (ER), progesterone receptor (PR), androgen receptor (AR), 
glucocorticoid receptor (GR), retinoic acid receptor ("RAR"), retinoid-X receptor ("RXR"), vitamin D 
receptor ("VDR"),peroxisome proliferator activation receptor ("PPAR"; such as for example PPAR , 

. 5 PPAR , and PPAR ), thyroid hormone receptor ("TR"), farnesoid receptor ("FXR"), insect ecdysone 
receptor ("EcR"), retinoid-Z receptor ("RZR"), and other related receptor homolog genes as is known 
in the art and/or can be determined by a skilled artisan by computerized homology searching of public 
nucleic acid and protein sequence databases. Green et al (1 995) Mutat. Res. 333:101 discloses the 
peroxisome proliferator activation receptor (PPAR). Greiner et al. (1996) PNAS 93:10105, Carlberg et 

10 al. (1 994) JMEndpcrjnoL 8.757, Medvedev et al. (1996) G^i8J.:199, and Schraderefal. (1996)1 
• Biol. Chem. 271:19732, disclose the RZR receptor. Kephart et al. (1996) Mol. Endocrinol. iO:40R 

discloses the vitamin D receptors (VDRs). Kozak et al. (1996) Mamm. Genome 7:164 discing thp 
farsenoid receptor (FXR). The sequences of the steroid receptor superfamily members can be readily 
obtained from public sequence databases and published scientific and patent literature. The : 

15 identification of the ligand binding domain (LBD) for each member is either published or can be 
determined by a skilled artisan on the basis of sequence identity and structural homology to known 
steroid superfamily receptors. Routine assay to determine the exact boundaries, if necessary, of the 
LBD of a given nuclear receptor can be performed by straightforward deletion analysis a 
measurement of ligand binding and/or retention of a fragment to confer ligand-induced derepression of 

20 a linked SSR. 

LBD-TRX Fusion Proteins 

. Polypept ides encodihg LBD-TRX fusion proteins which are ligand-activable 
transcription partners with complementing hybrid CA-TRX or CR-TRX fusion proteins can be 
25 expressed from a polynucleotide encoding a protein comprising a LBD of a nuclear receptor in 
polypeptide linkage to a TRX, optionally separated by a polypeptide spacer of from 1 to about 25 
aminb.acids of a non-interfering sequence, 

In aspects of the invention, the LBD portion of the LBD-TRX fusion protein is a mutein 
30 comprising at least one mutation as compared to the LBD sequence of the nafurally occurring nuclear 
receptor from which the LBD was derived. 

■ Kellendonk et al. (1996) Nucleic Acids Res. 24:1404. Zhang at al (iqqr) n^W 
Ackjs^es, 24:543. and Metzger et al. (1995) PMS 92:6991. report fusions of hormone-binding 
35 domains to Cre recombinase to produce a ^ hormone-inducible Cre recombinase. Logie C and Stewart 
AF (1995) opxil Report fusion of a hormone-responsive domain to FLP recombinase. Koib AF and 
Siddell SG (1996) Gene 1g3:1 also report, enzymaticaliy active fusions of the Cre recombinase. Thus, 



WO 99/50664 

PCT/US99/07168 

-28- 

Reporter.PnlvnuclentiriPQ 

is- used at the discretion i of the practrtion^ ^ 100 ^'eot,des or more than.250 nucleotides can be 
(CA-TRX and/or CP TRvi I Thereporterpolynucleot.de may be linked to the LBD-TRX 

(^a i Ha and/or CR-TRX) expression cassette or it may be unlinked. 

Reporter Prntoino 

aMne phpspnatase, and ^ toacenT^ •»* 
- N™n*a naa ^ . 

n eukatyottc cell plasma membrane is substantially impermeable. Thatissecha 



WO 99/50664 



PCT/US99/07168 



-29- 

substrate is not taken up by such cells in sufficient amounts to generate a readily detectable metabolic 
product that distinguishes cells expressing a reporter enzyme from other cells: Such is.the case for 
substrates of luciferase; alkaline phosphatase and beta-galactosidase. 

5 One type ofreporter protein comprises a sequence of a naturally-occurring cell 

surface-protein (e.g., human CD8, membrane-bound p immunoglobulin, glycophorin A) or may be a 
fusion protein (e.g., a -galactosidase/ -glutamyl transpeptidase fusion protein). Preferably, the cell 
. surface reporter protein either is a protein not normally expressed on the host cells or js heterologous 
; (i.e., from a different species), so that the cell surface reporter can be discriminated, for example with 

jo a species-specific antibody, from cell surface proteins that naturally occur on the host cell surface. 
More preferably, the cell surface reporter protein is both heterologous and a protein not normally 
expressed on the host cell. For example, a mouse hepatoma cell does not normally express a .... 
significant amount of CD8 protein, a protein characteristic of cytotoxic T lymphocytes. With reference 
to the mouse hepatoma cell, a cell surface reporter comprising a human CD8 protein is both 

15 heterologous and a protein not normally expressed on the cell type (i.e., hepatocyte).. A structural 
sequenced a cell surface reporter may comprise a polynucleotide sequence encoding a fusion 
protein having.a segment that does not occur naturally in eukaryotes (e.g., a bacterial sequence such 
as -galactosidase) fused, in correct reading frame, into an extracellular portion of a transmembrane 
protein (e.g., human -glutamyltranspeptidase heavy subunit; GenBank: Humggt, Humggtx; 

20 Goodspeed et al. (1989) GeneJ76: 1, incorporated herein by reference). 

Host cells 

. v Host cells typically include mammalian cells and other types of eukaryotic cells, such 
as yeast, insect cell, and fungi. Immortaiized host ceil cultures amenable to transfection and in vitro 

25 cell culture and of the kind typically employed in genetic engineering are preferred (e.g., HeLa cells, 
KB cells, JW-2 cells, Detroit 6 cells, COS cells, CV-1 cells, VERO cells, and NIH-3T3 cells). , 
Embryonic cell used for generating transgenic animals are also suitable (e.g., zygotes and embryonic 
stem cells). The cell type should be capable of expressing the. reporter construct encoding a fusion 
protein. Host cells are typically mammalian cells, and most preferably are mammalian cell lines, such : 

30 as HepG2, COS-7, CHO, HeLa, Hepa, NIH3T3, EBV-immortalized lymphocytes, or other cell lines 
available in the. art,. Suitable cell lines and cell strains may be obtained from several sources, 
including the American Type Culture Collection ("ATCC Catalog of Cell Lines and Hy.bridomas". 

- American Type Culture Collection, Rockville, MD, which is incorporated herein by reference). : 
Alternatively, it is possible to practice the inyention with primary cell explants, . such as lymphocytes, 

35 > hematopoietic stem cells, and primary hepatocytes. r 
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Various host cells may be selected in the discretion of the practitioner. The selection 
of a host cell generally is based upon the competence of the cell to support transcription from the 
transition regulatory sequence of the reporter polynucleotide. 

5 A host cell which harbors a reporter polynucleotide and fusion protein expression 

cassettes .s. a reporter host cell. A reporter host cell may harbor the reporter polynucleotide in any of 
several formats; including: .as a replicable episome, as a nohhomologously integrated transgene as a 
homologously targeted sequence, as an artificial chromosome, or as a transient non-replicable ' 
polynucleotide. A reporter host cell can be readily identified as such by the detectable presence of a 
io reporter polynucleotide, which comprises a transcription regulatory sequence operably linked to a 
reporter sequence and which, when considered as a unitary polynucleotide sequence spanning both 
structural and regulatory elements, is not present in the host cell genome as a naturally-occurring 
polynucleotide seaiiPhrp . 7 y 



15 



polynucleotide sequence. 

Specific protein-protein interactions are fundamental to most cellular and drganismal 
functions. Polypeptide interactions are involved in formation of functional transcription complexes 
signal transduction pathways, cytoskeletal organization (e.g., microtubule polymerization), polypeptide 
hormone receptor-ligand binding, organization of multi-subunit enzyme complexes, and the like. 

20 Two-Hvbrid Interaction Sy stPmQ 

Investigation of protein-protein interactions under physiological conditions has been 
problematic. Considerable effort has been made to identify proteins that bind to proteins of interest 
Typically, these interactions have been detected by using co-precipitation experiments in which an * 
antibody to a known protein is mixed with a cell extract and used to precipitate the known protein and 
25 any protems which are stably associated with it. This method has several disadvantages; such as- (1) 
-t only detects proteins which are associated in cell extract conditions rather than under physiological 
intracellular conditions, (2) it only detects proteins: which bind to the known protein with sufficient " ■ 
strength and stability for efficient co-immunoprecipitation, and (3) it fails to detect associated proteins 
which are displaced from the known protein upon antibody binding. For these reasons and others , 
30 .mproved methods for identifying proteins Which interact with a known protein have been developed. 

Two-Hvbrid Sy stPrriQ - 
One approach has been to use a so-called "two-hybrid" system to identify polypeptide 

35 al. (1991) Proc. Natl. Acad Sci (, WA) 88: 9578). This'approach identifies protein-protein interactions 
myjyo through recon.stitufion of a transcriptional activator (Fields S and Song 6 (1989) Nature 34b- 
. 245). the yeast Gal4 transcription protein. The method is based on the properties: of the yeast Gal4 
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protein, which consists of separable domains responsible for DNA-binding and transcriptional 
activation. Polynucleotides encoding two hybrid proteins, one consisting of the yeast Gal4 DNA 
binding domain fused to a polypeptide sequence of a known protein and the other consisting of the 
Gal4 activation domain fused to a polypeptide sequence of a second protein, are constructed and 
5 introduced into a yeast host cell. Intermolecular binding between the two fusion proteins reconstitutes 
the Gal4 DNA-binding domain with the Gal4 activation domain, which leads to the transcriptional 
activation of a reporter gene (e.g., lacZ, H/S3) which is operably linked to a Gal4 binding site. 
Typically, the two-hybrid method is used to identify novel polypeptide sequences which interact with a 
known protein (Silver SC and Hunt $W (1993) Mol Biol. Rep. 17: 155: DnrfPP et al. (1993) Genes 
: io DeyeL Z; 555; Yang et al. (1992) Science 257: 680; Luban et al. (1993) CeJ|73: 1067rHardy et al. 
(1992) Genes Devel. 6; 801; Bartel et al. (1993) Biotechnioues 14: 920; and Vojtek et al. (1993) Cell 
74: 205). However, variations of the two-hybrid method have been used to identify mutations of a ' 
known protein that affect its binding to a second known protein (Li B and Fields S ( 1 993) FASEB J. 7: 
957; Lalo et al. (1993) Proc. Natl. Acad. Sci. fUSA) 90: 5524; Jackson et al. (1 993) Mol. Cell: Biol: 13; 
is 2899; and Madura et al. (1 993) J. Biol. Chem. 268: 12046). Two-hybrid systems have also been used 
• to identify interacting structural domains of two known proteins (Bardwell et al. (1 993) . tried. Microbiol: 
g: 1177; Chakraborty et al. (1 992) J. Biol. Chem. 267: 1 7498; Staudinger et al. (1993) J. Biol. Chem. 
268: 4608; and Milne GT and Weaver DT (1993) Genes Devel. 7: 1755) or domains responsible for 
oligomerization of a single protein (Iwabuchi et al. (1993) Oncogene 8: 1693; Bogerd et al. (19,93) J. 
20 VjroL 67: 5030). Variations of two-hybrid systems have been used to study the jn yjyo activity of a 
proteolytic enzyme (Dasmahapatra et al. (1992) Proc. Natl. Acad. Sci. YtJSA^ RQ - 4159). Alternatively, 
an E. coli/BCCP interactive screening system (Germino et al. (1993) Proc. Natl. Acad. ScH U.S.A.) 
/ 90: 933; Guarente L (1993) Proc. Natl. Acad. Sci. /U S A ) gn- ir^q) r an be used to identify 
interacting prpteih sequences (i.e., protein sequences which heterodimerize or form higher order : 
.25 heteromultimersj. 

Each of these two-hybrid methods rely upon a positive association between two Gal4 
fusion proteins thereby reconstituting a functional Gal4 transcriptional activator which then -induces 
transcription of a reporter gene operably linked to a Gal4 binding site. Transcription of the reporter " 
jo geneproduces a positive readout, typically manifested either (1) as an enzyme activity (e.g., - 

galactosidase) that can be identified by a colorimetric enzyme assay or (2) as enhanced: cell growth on 
a defined medium (e.g., H/S3). Thus, these methods are suited for identifying a positive interaction of 
polypeptide sequences/but are poorly suited for identifying agents or conditions which alter (e.g.; 
inhibit) intermolecular association between two polypeptide sequences. 
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. Reverse Two-Hvbrid Systems 

A general method is provided, referred to herein as a reverse two-hybrid method 
wherein agents which disrupt an intermolecular association between two interacting polypeptides 
(LBD and either a CA or CR domain) thereby generate a selectable and/or detectable readout (e.g., 
5 complementation of an auxotrophic phenotype, expression of a detectable reporter molecule, and the 
like). Typically, a reverse two-hybrid method produces a positive readout under conditions wherein an 
agent blocks or otherwise inhibits the intermolecular binding of the interacting polypeptides. A positive 
readout condition is generally identified as one or more" of the following detectable conditions: (1) an 

• increased transcription rate of a predetermined reporter gene, (2) an increased concentration or 

10 abundance of a polypeptide product encoded by a predetermined reporter gene, typically such as an 
enzyme which can be readily assayed in yjyg, and/or (3) a selectable or otherwise identifiable ; 
phenotypic change in an organism (e.g., yeast) harboring the reverse two-hybrid system. Generally a 
selectable or otherwise identifiable phenotypic change that characterizes a positive readout condition 
confers upon the organism either: a selective growth advantage on a defined medium, a mating 

15 phenotype, a characteristic morphology or developmental stage, drug resistance, or a detectable 
enzymatic activity (e.g., -galactosidase. luciferase, alkaline phosphatase, and the like). In this 
manner, it is possible to efficiently identify agents (including but not limited to polypeptides, small 
molecules, and oligonucleotides) which inhibit intermolecular binding between two predetermined 
interacting polypeptides. 

20 •. ■■ 

In an aspect of the invention, a reverse two-hybrid system is composed of: (1) a first 
hybrid protein (e.g.. an LBD-TRX), (2) a second hybrid protein (e.g., a CA-TRX or a CR-TRX) which 
binds to the first hybrid protein under control conditions . (e.g.. physiological conditions in the absence 
of agent), (3) a relay (or signal inverter) gene which is efficiently expressed as a consequence of the 
25 first hybrid protein and the second hybrid protein being functionally bound to each other, and (4) a 
reporter gene which is eff iciently expressed when the product of the relay (or signal inverter) gene is 

• substantially absent and is either poorly expressed or not expressed when the.relay (or signal inverter) 
gene product is efficiently expressed, the first hybrid protein and second hybrid protein bind to each 
other through interacting polypeptide segments (i.e., a portion of the first hybrid protein preferentially 

30 binds to a portion of the second hybrid protein forming a heterodimer or higher order heteromultimer 
• : cqmprisin 9 the first and sec ond hybrid proteins; said binding portions of each hybrid protein are 
termed "interacting polypeptide segments"). 

The first hybrid protein comprises: (1) a first interacting polypeptide sequence (LBD) : ' 
35 in polypeptide linkage with (2) a DNA-binding domain of a transcriptional activator protein or other 
DNA binding protein (e.g., a repressor). The second hybrid protein comprises: (1) a second 
interacting polypeptide sequence (CAorCR), capable of forming an intermodular association with 
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the first interacting polypeptide sequence under control conditions (e.g., physiological conditions and 
absence of agent) in polypeptide linkage with (2) an activation domain of a transcriptional activator 
protein, whereby, intermolecular binding between the first hybrid protein and 'the second hybrid protein 
(via the interacting polypeptide sequences) thereby unites the DNA-binding domain of the first hybrid 
5 protein with the activation domain of the* second generating a transcriptional activator function ■ 
Generally, the first hybrid protein and the second hybrid protein are encoded by polynucleotides which 
are constitutively expressed in a host organism (e.g.,- a eukaryotic or prokaryotic cell, or multicellular 
organism). 

0 The relay gene (alternatively termed the signal inverter gerie) is operably linked to a 

transcriptional regulatory sequence (a "relay transcriptional regulatory sequence") which is positively 
regulated by the transcriptional activator that is formed by the intermolecular binding of the first hybrid 
protein to the second hybrid protein: Hence, when the first hybrid protein binds to the second hybrid 
protein (via the interacting polypeptide sequences), the transcriptional activator formed thereby binds 
is to a transcriptional regulatory sequence operably linked to the relay gene and enhances the net 
transcription of the relay gene. The relay gene encodes a protein that represses transcription of a* 
reporter gene. Thus, when the first and second hybrid proteins are functionally bound to each other, 
the relay gene is expressed and thereby represses transcription of the reporter gene(s). In an 
embodiment, such relay proteins are of the type often referred to in the art as "negative regulators of 
20 transcription". In an embodiment of the invention, the relay gene is a negative regulator of 
transcription in yeast; for example but not limitation the GAL80 gene can serve as a relay gene in 
yeast. In embodiments where host organisms are employed to harbor the reverse two-hybrid system, 
the relay gene is often a gene which naturally occurs in the germiihe DNA of the host organism 

.... species/.and frequently can be an endogenous germline gene, or alternatively may be introduced into 
25 the host organism as exogenous DNA, typically into a host genome that lacks the corresponding 
functional endogenous gene (e.g., a "knockout background"). 

, mi < The reporter gene is operably linked to a transcriptional regulatory sequence 
("reporter transcriptional regulatory sequence") which: is : negatively regulated by the gene product of 
30 the relay gene and which is induced in the absence of the relay gene product. Thus, transcription of 
the reporter gene is repressed in control conditions (e.g., physiological conditions in the- absence 'of 
. agent) wherein the two hybrid proteins bind to each other and form a transcriptional activator that . 
increases transcription of the relay gene. Generally, the relay gene product either binds to the . 
transcriptional regulatory sequence operably (inked to the reporter gene, or binds to a transcription 
35. protein that binds to the transcriptional regulatory sequence operably linked to the reporter gene; The 
net transcription rate of the reporter gene is reduced (or completely blocked), as a consequence of the 
.relay gene product binding to the reporter gene transcriptional regulatory sequence and/or to a • 
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transcnpfon protein required for constitutive expression- of the reporter gene. Any of a variety of 
reporter genes that produce a positive readout can be used. For example and not .imitation, suitable 

e^ 

s suchasby.Muassayorthelike. Suitable genes which confer a selectable phenotypa are 
(e.g yeast HIS3), genes wh.ch encode drug resistance (e.g., neo\ genes which induce cell 

aZsTri T* Wh0S6 eXPre * Si ° n C ° nierS 9 Se,eCtiVe ^9e.: Suitable ■ 
genes wh,ch encode a gene product which is conveniently detected in situ are exemplified by but not 

peroxidase, and the like: ' 



The invention provides polynucleotides encoding a first hybrid protein and a second 

frame of polynucleotide sequences encoding one or more interacting polypeptide seeuence(s) 
Typca y, a nrsl.polynUcfed.de wi» encode a« hybrid protein compeLd oi a firs, P ™ ed 
tn.erac.ng polypephde sequence and a DNA-binding domain of a transcripttona, ac,^ aTe^nd 

,o - fflT " K °** ^ hybrid P * n ~ °' » ^ZmJZS, 
£2T .T aW ' a " aC,iVa,i0 " d ° main °* < ****** «W; «*dre,n the DNA- ' 
ST hybrid PrOtei0 ^ rSC ° n * te «* * *»»*» *■* -d forma 

hybnd p,o,e n pau are derived from me san» na.u ra ,ly occurring transcript actnsrlor ,e g Gal/ ' 

2 5 ransenphon activators which can ntkMHU.' to Jbm. i"*™,*^ i-o-e^io^^i^^fc^J^S^- 
not occunn nature (e.g.. a DNA-binding domain of * bacteha, -to, prolein can be J™ *" 
SET """h ,rKSC,,, " 0na ' ac,iva, °' lrom *• «•> P«** VP16; Voj.eke.al. (1993) M , ., 

3oSS^^ 

ragulatory sequence operably linked to a relay (or signal inverter) gene. A relay (or staal inverter) 

CSSST*"* ■ V* USUaUy ' 3 Pro,ei " 15 of trahsenp ion tor a 

o^rZ h f ^ *** ' n " *• relay prorein is a transition Lessor 
protein that binds to a po,ynuc,eo,ide ? equence and thereby inhibits transcription o, a is,,„ked an " 
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operably linked sequence. In an alternative embodiment, the relay protein binds to a protein that is a 
positive regulator of transcription of a predetermined gene or gene subset, and as a consequence of 
binding thereby inhibits the transcriptional activity of the positive regulator. One variety of such a' relay 
protein binds to and blocks the activation domain(s) of transcriptional activators. Although a variety of 

5 suitable relay proteins are apparent to those of skill in the art, this category of relay protein can be 
exemplified by the mammalian mdm2. oncoprotein which binds the transactivation domain of the tumor 
suppressor protein p53, and the yeast Gal80 protein which binds and inactivates the activation domain 
of Gal4. In an embodiment, the relay protein comprises a mutation, addition, or deletion that reduces 
the stability of the relay protein in yjyo as compared to the naturally occurring cognate relay protein. 

io Relay proteins can be referred to as signal inverter proteins, as they serve to invert a positive 
transcriptional signal (the reconstitution of a functional transcriptional activator by binding of the two 
hybrid proteins) into a negative transcriptional signal, which reduces transcription of a predetermined 
reporter gene. Generally, a polynucleotide encoding a relay protein is operably linked to a relay 
transcriptional regulator/ sequence that produces transcription of the relay gene dependent upon 

is functional reconstitution of the DNA-binding domain and activation domain of the two hybrid proteins. 
For example and not limitation < such a relay transcriptional regulatory sequence can comprise a 
promoter and a polynucleotide sequence comprising one or more site(s) which bind(s) a reconstituted • 
functional transcriptional activator formed by association of the two hybrid proteins; for example, if the 
two hybrid transcriptional activator comprises a lexA DNA-binding domain, the relay transcriptional 
20 regulatory sequence operably linked to the relay gene can comprise one or more lexA binding site 
sequences, arrayed in tandem. 

Other aspects of two-hybrid, systems are described in U.S. Patent 5,525,490, 
incorporated herein by reference for all purposes. 

25 . . ; : ; "' ■,■ .-. ••'•'"'V-\ ' 

. •. ' Other Aspects 

The invention provides a method for identifying agonist ligands -of a predetermined 
nuclear receptor. The method comprises: 

3Q ' . forming agent-treated host cells by contacting (1) a test agent with (2) a population of reporter 

■ , ceiis; ' V . ; -v ; ; ... ' • 

incubating the agent-treated host cell under incubation conditions for a suitable incubation 
period sufficient for distribution of the. test agent to the nucleus to allow for formation, if any, of 
35 a liganded complex comprising a LB D-TRX fusion protein and a CA-TRX (or a CR-TRX), 

fusion protein; and, ••'./'/.. , . . ; ; 
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'dent.fy.ng as an agonist ligand a test agent which produces a statistically significant increase 
m detectable expression of the reporter protein in the population of agenMreated host cells as 
compared to a reference population of untreated host cells incubated under substantially 
5 identical conditions and.lacking said test agent. 

In an aspect of the method, the test agentconsists of at least one species of small 
molecule (M.W. ,3,000 Dahons): In an aspect of the method; the test agent* a lipophilic compound 
In an embodiment, the test agerit.is a steroid, retihotd, thyroxine analog, vitamin D derivative or 
■ polycyclic aromatic hydrocarbon. In an embodiment, untreated host cells are contacted with vehicle or 
io aknown inert substance inpteceonhetestag^^ 

100 am.no acids of a non-interfering amino acid sequence (which may be selected by the artisan and ' 
determ.ned empirically without necessitating . undue experimentation) is in polypeptide linkage between 
the LBD segment and the TRX segment and/or between the CA (or CR) segment and its linked TRX 
segment, such as to reduce potential steric hindrance. r : 



15 



In an aspect, the method may be used to rank-order the efficacy andA)r potency (or 
other pharmacological property) of a collection of test agents for agonist or antagonist activity relative 
fo a predetermined nuclear receptor, A collection (or library) of test agents can be screened either by 
parallel screening of individual test agent species or in pools of test agents having related structural 

20 features, to determine the ability of each test agent species or pool thereof to elicit a dose-dependent 
.and/or time-dependent activation of the LBDrTRX in a population of reporter host cells. Themethod 
typically employs a populate of reporter host cells which are subdiv.ded .nto a plurality of f 
subpopulations that are cultured under substantially identical conditions, such as for example in the 

. .wells of a multiwel. culture vessel (e.g.,^6 w^^ 

25, : (e.g., wells).each having an approximately equivalent reporter host cell subpopulation exposed t6 ' ^ 
substantially identical conditions. ^ 

or concentration of a test agent or a pool of test agents, such that , or a given test agent species or 
. pool. : a plu^ahty of dosage levels^r concentratfons W^presented in the collection: orcultuW vessels ■ 
For example and not limitation, a 96-well plate may have each.row representing a distinct test agent 
.30 spec.es and each column representing a series of predetermined dosages or c6nentrations (or 
. predetermined dilution-ratiosrof a stock test agent solution of unknown concentration). Following • ' ' 
addrt.on.of the test agent or pool, the culture vessels containing the reporter host cells exposed to the 
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test agent or pool are incubated under suitable incubation conditions (e.g., conventional cell culture 
conditions) for a time period sufficient to produce activation of a nuclear receptor of the type from 
whence the LBD portion of the LBD-SSR in the reporter host cells was obtained. Following or during 
the incubation time period, the expression of cell surface reporter protein is detected from each culture 
5 vessel and compared to the expression level, if any, of the cell surface reporter in a parallel culture 
vessel containing reporter host cells cultured under substantially identical conditions but iacking a test 
agent, but in some embodiments administered the vehicle used to deliver the test agent(s), as a - : 
control to establish background reporter expression. If desired, the level of cell surface reporter 
expression in each culture vessel can be detected and quantified at a plurality of time points, such as 
10 to determine a time course of activity for each test agent species or pool. By comparing the amount of 
cell surface reporter expression in each culture vessel; a dose-response curve (e g., wherein 
expression of the cell surface reporter is the response) and/or activation time curve can be calculated 
for each test agent or pool to determine the pharmacological profile of each test agent or pool. 
Exemplary pharmacological parameters can include, for example, ED^, binding constants, latency of 
is activation, and the like, among others. Each test agent species or pool can be rank-ordered relative to 
other test agents or pools evaluated under substantially identical conditions on the basis of the dose- 
response data thus obtained, and preferred test agents having desired pharmacological parameters 
can be identified thereby. 

20 The invention also provides kits comprising an expression polynucleotide encoding a 

LBD-SSR fusion, a reporter polynucleotide of the invention, and, optionally, a suitable host cell, and 
optionally, instructions for use: In a variation, the kit also comprises a seeond polynucleotide which 
confers a selectable phenotype on host cells in which said second polynucleotide has undergone site- 
specific recombination. In an embodiment, the kit comprises reporter cells suitable, for use in test 

25 agent screening assays to identify agonists and/or antagonists for ligands of the predetermined 
nuclear receptor. 

Although the present invention has been described in some detail by way of 
illustration for purposes of clarity of understanding, it will be apparent that certain changes and 
30 modifications may be practiced within the scope of the claims. The following experimental examples 
are offered for illustration and are not intended to limit the invention in. any way; 
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EXPERIMENT AL EXAMPt Fft 

■ Overview 
The use of nuclear receptdncoactivator interactions as a means for identifying and 
categorizing nuclear receptor Jigands to be used in higfvthroughput screening was evaluated A direct 
5 interact™ assay suitable for in vitro use with enzymatic readout (ELlSA) was constructed using fusion 
proteins containing TR (thyroid hormone receptor) or ER (estrogen receptor) ligand-binding domains 
(LBDs) and fragments of the coactivator protein, SRC-1 . Specific, ligand-dependent interaction was 
demonstrated. 



Mammalian cell lines containing^ two-hybrid system comprising a reporter dene and 
• T, hybnd m0lecu,es: one containing a nuclear receptor LBD (TR or ER) fused to a VP16 acidic 
act.yat.on domain, and the second containing coactivator protein fragments having at least one 
leucine-charged domain (LCD) fused to a Gal4 DNA-binding domain, wherein the two hybrids can 

HTp? R n ene - ThethirdLCDo,SRC - 1 -i*«~-fcMnd'lb db^emtUB fi^lnta^clte M. %|"- 
and ER LBDs, and that the second LCD interacts preferentially with ER rather than TR. In addition 
Ngand-dependent dissociation of the Tr LBD from fragments of the compressor proteins SMRT and' 
NcoR was demonstrated in this format. No detectable interaction between these corepressor . 
fragments and theER LBD was observed in the absence of ligand. Novel peptide motifs which 

random and focused (centered on. the. leucine charged motif, LXXLL) recombinant peptide diversity 
libraries were screened and novel sequences which interact specifically with ER were identified- ■ 
estradiol increases the ability of thses novel peptides to interact with the ER . 

25 Example 1 • njreet Interaction Accay 

: f0 - 

orma (ELISA for measuring interaction* a nuclear receptor LBD with a binding fragment of steroid 
, ^ptorcoact^ator^SRC-ip 

30 TSt h0 ; m ; nereCept0r 0r form > fused ■"-frame to a glutathione S-transferase sequence (GST) to 
,o orm the binding member for the nuclear receptor LBD which is immobilized on a substrate indicated 

con I' ^ T " S ^ e ' eCtr0StatiC bindinQ t0 3 P,aS,i6 96 - We " P ,a,e - The coactivator member 
contams 3 leucine charged domains (LCDs)of SRC-1 fused in -frame to GST/MBP at the aminos 

termmus and fused in-frame to an antibody 179 epitope tag at the carboxy-terminu s ; The LCD ' 

sequences are shown. The SRC-1 fusion memberthat is bound to the immobilized LBD is detected 

otrr 6 " ^ SPeC " iCal,y reaC,S W " h ,he 179 epit0pe and a —nd a ntib0 dy (alkaline 
Phosphatase conjugated antimouse Ab). Fig. 2 shows the results of the direct interaction SS ay USing 
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ER LBD interacting with SRC-1 . 0.5 ug of purified GST-ER was used with varying amounts of crude 
bacterial lysate containing GST-SRC-1. The interaction is specific (GST control) and enhanced by 
estradiol. Fig. 3 shows the results of the direct interaction assay using ER LBD interacting with SRC- 
1 . 0.5 pg of purified GST-ER was used with varying amounts of crude bacterial lysate containing 
5 GST-SRC-1 . The interaction is specific (GST control) and enhanced by estradiol. Fig. 4 shows results 
of a direct interaction assay of the TR andTR LBDs interacting with SRC-1. The indicated amounts 
of GST fusion proteins were used. The interaction is specific and dependent on T3. Fig. 5 
shows results of a direct interaction assay as a dose-response curve of T3 in promoting interaction 
between the TR andTR LBDs and SRC-1 . T3 appears more potent on the receptor. Direct 
to protein-protein interactions can be detected in cell-free systems in a variety of formats, including 
phage-diplayed proteins and purified proteins, among other formats evident to those skilled in the art. 

Example 2: Positive Hybrid SvstemrCoactivator ' 

Fig. 6 shows a schematic portrayal of LRB-TRX, CA-TRX, and CR^TRX constructs 

15 used and the reporter polynucleotide of a two-hybrid system. "Receptor" indicates LBD-TRX, with the 
. LBD exemplified by a genereic nuclear receptor (NR) LBD with bound ligand (L), and the TRX domain 
. represented by the VP1 6 acidic activation domain (VP1 6). "Coactivator" indicates a CA-TRX, with 
exemplary CA species being a portion of the SRC-1 protein, a portion of the CBP protein, or isolated 
LCD sequences from those proteins, the TRX domain is exemplified as the Gal4 DNA binding domain 

20 (DBD). "Corepressor" indicates a CR-TRX, with exemplary CR species being the interacting domains 
(ID1 or ID2) from SMRT or NcoR fused to the TRX domain as exemplified by Gal4 DNA binding 
domain (DBD). The reporter polynucleotide is exemplified as a UAS-TK-luciferase reporter containing 
binding sites for Gal4 DBD. CHO cells were triple transfected with these constructs and hormone 
added: Fig. 7. shows data from a positive two-hybrid system as a doserresponse curve showing that 

25 reporter (luciferase) is generated only when all three components (VP 16ER . Gal4-SRC-1 , and UAS- 
TK-luciferase) are transfected together, and that the response is dose-dependent with respect to 
estradiol concentration. Fig. 8 shows data from a positive two-hybrid system as a dose-response 
curve showing that reporter (luciferase) is generated only when all three components (VP16TR ; Gal4- 
SRC-1 , and UAS-TK-luciferase) are transfected together, and that the response is dose-dependent 

30 with respect to T3 concentration. 

Example 3: Positive Hybrid Svstem:Corepressor 

Fig. 9 shows coactivatOr/corepressor interactions with TR- LBD in the positive two- 
hybrid system. NcoR ID1 , and to a lesser extent SMRT ID1 interact with TR LBD, but neither ID2 , 
35 interacts strongly with the LBD. T3 induced dissociation of the corepressor fragments at significantly 
lower concentrations than is required to recruit a coactivator. Fig. 10 shows interactions with ER LBD 
in the positive two-hybrid system. NcoR ID1 , and to a lesser extent SMRT ID1 interact with ER LBD, 
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Examnlg 4- I dggWicaitonoj Nqv b i , m< 

5 Fig. 11 shows a schemata portrayal of a method lor identifying novel LCri »„„„„.. 

roml^ 

^repressonuot). ^W-fc^^^^.^i^^^ 
GST-ER rnthe presence ol estradiol in tour rounds ol enrichment ' ' m,no,>lteea 

UXLUo^dc^ 

aall 1 or random (-XXXXX-) 1 5mer library in the presence of ER and estradiol'. 

,« K..ro Ra- 13 shows the sequences ol Lacl-lrjsed peptides obtained by screenina the libr an , 

yER .r^^m**^**^^^^™*^-;. 

«m •"■»* ( Comrol is SRC, (ragmen, ,597-781) containing the 3 LCDs. Note the e radTo 

6 P rT ^ °' *«" ER (— i-,-,ng. ,4 with n. Fid 

In order to more accurately characterize a subset of the peptides selected from 
^Ie S 

, P-sbya^chromatog^ ; 
» -ays (seebe^. Binding otpnritied SRC, wUSSlS ' 

amounts „„mmob,„ 2 ed target receptor, the signs, intensity with, he SRC-, W ,D,SRC^ 3 in,h e •' 
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ELISA varied little from receptor to receptor and that MBP alone gave consistently low signals. We 
therefore-analyzed both natural and library-selected peptide fusions in the context of the specific 
signal obtained with SRC1+2+3. At the concentration of MBP fusions used, differences in the ability 
of relatively high affinity proteins (such as the NRID fragments) to interact with the immobilized 
5 receptors are not readily apparent: Therefore, for the NRID fragments the differences seen in the 
M2H assay are less apparent, however, lowest signal relative to SRC1 +2+3 is with SRC2 and ER 
This is consistent with the M2H assay where the difference in luciferase signal between SRC1+2+3 
and SRC2 is greatest for ER . The actual differences in affinity between the NRfD fragments is 
relatively small compared to differences seen in NR-box 1 9mer peptides. These results indicate that 
10 the interpretation of the minimum requirements for cofactor/NR interaction are likely highly assay tvoe 
dependent. 

Testing the MBP fusions containing natural coactivator-derived peptides'by ELISA 
revealed striking differences in signal intensity between different peptides and between different 
15 receptors. As expected, SRC2 bound well to all four receptors while SRC1 bound less well to all four 
but especially poorly to the TRs. SRC3 also bound less well than SRG2 but preferred the TRs over 
the ERs as expected. Similar.results were obtained with the other two SRC-1 family members 
however, NR-box #3 of RAC-3 appears to bind to the ERs more strongly than to the Trs. ' 

20 Analysis of purified MBP-peptides derived by selection from the peptide library 

indicated that there is general ER and TR selectivity of peptides selected on these two receptors 
respectively. The degree of selectivity predicted by this assay, however, is hot as great as that 
predicted from the ELISA done with crude bacterial lysates. For instance, one ligand (M 153) is quite 
ER selective in the assay using a crude lysate, while the purified MBP-peptide fusion appears equally 

25 . effective at interaction with both ER and TR . This could be due to the nature of the assay as 
suggested above; in this case the absolute amount of the MBP fusion in the assay is likely far greater 
when using purified proteins, thus somewhat driving low affinity interactions. Although these peptides 
were selected with the -forms of ER and TR, to a large extent the selectivities extend to the -form of 
the respective receptor although some differences (for instance ligand M-693, TR vs. TR ) suggest ..' 

30 that / selective peptides could be identified. 

... Example 6: Determination of recRntor/coactivator-mi m etic peptide intPraction affinities h y radio: 
■ competition assay 

35 ,n order t0 assess tne interactions of the natural and selected sequences by a more 

quantitative methodology, we setup a radio-competition assay using immobilized receptor LBDs and 
.odmated SRC-1 NRID (SRC1 +2+3) as the tracer. We set up a homogeneous binding assay based 
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■ . on scintillation proximity (SPA). Control experiments using unlabeled tracer as the competitor. Even 
in the absence of estradiol, ER interacts specifically and saturably with the NRID. The IC M for 
SRC1+2+3 is 21nM without ligand but decreases -5 fold to 4.3nM upon the addition of estradiol In 
addition, as expected from the ELlSA experiments, the amount of tracer bound to the immobilized 
5 receptor increases dramatically with ligand addition. TR does not. interact measurably with the NRID 
in the absence of ligand, consistent with our observation of consistently much lower signals with TR ir 
ELISAs as compared to ER in the absence of ligand. Addition of ligand to the TR SPA results in the 
expected association of the " 5 I NRID and an IC^, of 5.1nM for unlabelled SRC1+2+3. 

10 We next used the MBP fusions described in the previous section as competitors in 

both theER -NRID and TR -NRID SPAs in the presence of estridiol and triiodothyronine respectively 
To ensure that affinity and selectivity trends we observed were not artifacts of any one protein 
preparation, we re-isolated a second batch of several of the MBP fusions and repeated all 
measurements. Although there are some quantitative differences between the two protein 
15 preparations, the results lead to the same conclusion in each case. As expected, the IC values for 
SRC1+2+3 and SRC2+3 are nearly identical (3-11nM) for the TR assay and these protein fragments 
are more potent than any natural or synthetic, single NRbox-containing peptide, assayed on this 
receptor. SRC1+2 and SRC2, either as a peptide (1 9mer) or fragment (~56mer), are less potent, 
consistent with the M2H assays above. Homologous peptides derived from the three SRC-1 family 
20 members behave in a similar fashion with TR . The order of affinity of the three NR-boxes in each 
case is #2 > #3 > #1 suggesting, that at the level of NRID - AF2 interactions, TR displays no 
preference for any one^of the three SRC-1 family members. Consistent with our M2H experiments 
the amino-terminal NR-boxes of CBP and p300 have relatively poor affinity for the TR . While many of 
the TR -selected peptides have IC M values similar to the best natural peptides, eg SRC2 the ER - 
25 selected peptides have low affinity for TR . The fold selectivity of the TR -selected peptides for the TR 
;.s always >2, except for ONE LIGAND (M-655), which we have shown by ELISA to bind to both TR 
and ER . In contrast, with few exceptions, most of the natural coactivator-derived peptides have 
receptor selectivitiesof only 1-2fold. A greater fold selectivity of SRC3 for TR and TIF1 or RAC1 for 
ER is likely given the ELISA and M2H results. 



30 



The interactions of the natural coactivator-derived peptides or SRC-1 fragments with 
ER are slightly different than with TR , as expected. While SRC1 +2+3 still has the lowest IC there 
.s little difference between the values obtained for SRC1+2 and SRC2+3 on ER . in contrast to those 
obtained for TR . SRC2, in. fragment or peptide form, has lower affinity than the fusion proteins 
35 containing more than one NR-box as predicted. by M2H assay. In addition, the order of affinity of the 
three NR-boxes in each SRC-1 family member for ER is #2 > #3 = #1 . From the data it appears that 
TIF-2 and RAC-3 were more effective coactiyators of ER -dependent transcription than SRC^t as the 
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affinities of the #1 and #3 NR-boxes of these proteins are significantly higher than those of the . 
corresponding NR-boxes of SRC-1 . While previous studies using yeast-two-hybrid systems have 
shown that the ER LBD is able to interact with the amino terminus of CBP (Kamei 1 996) or with the 
amino terminal CBP.NR-box (albeit poorly compared to the SRC NR-boxes, Heery 1997), pur data 
5 suggest that these interactions are likely of quite low affinity. Peptides we selected using ER, as the 
panning reagent show a relatively high degree of selectivity for ER over TR . As with the TR - 
selected peptides and TR , the affinity of the ER -selected peptides for ER is similar if not slightly 
higher than the best natural NR-boxes. In addition, similar to the TR -selected peptides, these 
individual peptides do not have affinities for ER that are as high as any of the SRC-1 fragments 
10 containing more than one LXXLL motif. The experiments above demonstrate that screening peptide 
libraries with individual receptors is a successful method for obtaining receptor-selective peptides with 
affinities equal to or greater than the relatively non-selective natural NRbox-derived sequences. 

The foregoing description of the preferred embodiments of the present invention has 
15 been presented for purposes of illustration and description, . They are not intended to be exhaustive or 
to limit the invention to the precise form disclosed, and many modifications and variations are possible 
in light of the above teaching. 

All publications and patent applications herein are incorporated by reference to the 
20 same extent as if each, individual publication or patent application was specifically and individually 
indicated to be incorporated by reference. 

- Such modifications and variations which may be apparent to a person skilled in the 
. . art are intended to be within the scope of this invention. 
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CLAIMS 



1, A positive hybrid nuclear receptorsignal transduction system, comprising an 
intact eukaryotic host cell, comprising: (1) a LBD-TRX polynucleotide sequence encoding and 
5 expressing a l.gand-activatable fusion protein which comprises a ligand binding domain ("LBD") of a 
predetermined nuclear receptor covalently linked-to a transcriptional activator domain ("AD") of a 
predetermined transcription factor,(2) a CA-TRX polynucleotide sequence encoding and expressing a ' 
coacovator fusion protein which comprises a domain of a nuclear receptor coactivator protein ("CA") 
capable of binding to said LBD of said predetermined nuclear receptor linked to a DNA-binding 
to domain of a predetermined transcription factor, and (3) a reporter polynucleotide sequence 
compnsing, in linear order, a transcriptional regulatory sequence which is responsive to said 
predetermined transcription factor and a reporter cassette encoding a sequence that confers a signal 
or detectable phenotype. ' 

15 . ; ♦ ' , 2 A positive hybrid nuclear receptor signal transduction system comprising an 
-ntact eukaryotic host cell, comprising: (1) a LBD-TRX polynucleotide sequence encoding and 
expressing a l.gand-activatable fusion protein which comprises a ligand binding domain ("LBD") of a 
predetermined nuclear receptor covalently linked to a transcriptional activator domain ("AD") of a 
„ n Predetermin6d tran ^P«onfactor,(2) a CR-TRX polynucleotide sequence encoding and expressing a 
20 corepressor fusion protein which comprises a domain of a nuclear receptor corepressor protein ("CR") 
capable of binding to said LBD of said predetermined nuclear receptor linked to a transcriptional 
activator domain of a predetermined transcription factor, and (3) a reporter polynucleotide sequence- 
comprising, ,n linear order, a transcriptional regulatory sequence which is responsive to said 
; predetermined transcription, factor and a reporter cassette encoding a sequence that confers a signal 
25 or detectable phenotype. . ;. y 

3: A nuclear receptor signal transduction system, comprising a reverse hybrid 
reporter host cell containing: (1) an LBD-TRX polynucleotide sequence encoding a fusion protein •• 
comprising a l.gand-binding domain of a nuclear receptor in polypeptide linkage a DBD or AD of a 
30 transcription factor; (2) a CR-TRX polynucleotide sequence encoding and expressing a corepressor 
' fUS, °" pr ° tein ( " CR - TRX ") which com P^es a domain of a nuclear receptor corepressor protein ("CR") •' 
capable of binding to said LBD of said predetermined nuclear receptor linked to a transcriptional 
; , activator domain of a predetermined transcription factor (TRX), and (3) a relay (or signal inverter) gene 
- a P^tein which is efficiently expressed, as a consequence of the LDB.-TRX binding to the ■ 

35 CR-TRXas a transcriptionally active complex, and (Reporter gene comprising, in linear order a ■ 
tran^onal regulatory sequence responsive to said transcriptionally active complex, and a reporter 
cassette encoding a reporter, wherein the reporter gene is efficiently. expressed then the product of 
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the relay (or signal inverter) gene is substantially absent and is either poorly expressed or not 
expressed when the relay (or signal inverter) gene is efficiently expressed. 

4. A multiplexed format assay for identifying nuclear receptor ligands comprising 
, 5 at least two of the following: (1) positive hybrid system:coactivator; (2) positive hybrid 

system:corepressor, (3) reverse hybrid system:coactivator, (4) reverse hybrid system;corepresspr, (5) 
direct interaction assay, or (6) other art-know assay for identifying and/or quantifying ligand efficacy 
and/or potency as" an antagonist or agonist of nuclear receptors. 

10 ••- 5. A method for identifying a candidate pharmaceutical agent from a library of 

test agents, wherein the candidate pharmaceutical agent has a desired biological effect profile 
comprising: (1 ) performing n (where h is a number greater than 2, preferably greater than 3, and less 
' than 1 0 billion) distinct assays of the invention individually using each discrete test agent (which may 
be a mixture) of the library so as to obtain for each individual test agent measurements of at least n 

is biologic^! effects as detected as a ligand-induced conformational change or a binding interaction 
change in a discrete assay, (2) for each biological effect detected, assigning a score value (binary or 
quantitative) based upon the detection (or lack thereof) of a ligandHnduced conformational change or' 
binding interaction in each assay, separately, to generate a score matrix ("bioeff ect fingerprint") for - 
each agent, and (3) comparing each agent's score matrix to an equivalent score matrix for one or 

20 more predetermined agonist(s) and/or antagonist(s), and thereby identifying agents having score 
matrices substantially similar to the score matrix. of said predetermined agonist(s) or antagonist(s).: 

6. A polypeptide comprising a binding amino acid sequence that is: (t) non- 
naturally occurring in a nuclear protein, and (2) predetermined to bind to a nuclear receptor/typically 
25 by interaction with a binding interface of a nuclear receptor at which naturally-occurring coactivators 
■]'\ and/or compressors bind. . / f . > 

; 7. . A polypeptide of claim 6, wherein. said polypeptide comprises a sequence • . ' . 

selected from the group of sequences shown in Fig. 16. 
30. . ; > . . ■ ■ ■ * . ■' - 

8; . An agonist ligand of a nuclear receptor identified by the process of screening 
a library of test agents with a positive hybrid nuclear receptor system, a reverse, hybrid nuclear , 
receptor system, or a combination thereof , and selecting agonist ligands. ■ 

35 . 9 - An agonist ligand of claim 8, wherein the process comprises thelurther step . 

•'. o{ screening said library with a direct interaction assay i and selecting agonist. ligands which are 
detected as positives in each screening step. 
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